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\Q (54) Title: METHODS FOR PRODUCING SULPHUROUS FINE CHEMICALS 
00 

P (54) Bezeichnung: VERFAHREN ZUR HERSTELLUNG VON SCHWEFELHALTIGEN FEINCHEMIKALIEN 

^ (57) Abstract: The invention relates to methods for producing sulphurous fine chemicals, in particular L-methionine, by fermenta- 

f*} lion, using bacteria, in which a nucleotide sequence that codes for a methionine synthase (metF) gene is expressed. 

O 

Q (57) Zusammenfassung: Die Erfindung betrifft Verfahren zur fermentativen Herstellung von schwefelhaltigen Feinchemikalien, 
insbesondere L-Methionin, unter Verwendung von Bakterien, in denen eine fur ein Methionin-Synthase (meiF)-Gen kodierende Nu- 
kleotidsequenzen exprimiert wird. 
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Verfahren zur Herstellung von schwefeihaltigen Feinchemlkalien| 

Beschreibunq 

Gegenstand der Erfindung ist ein Verfahren zur fermentativen Herstellung von schwefeihaltigen 
Feinchemikalien, insbesondere L-Methionin, unter Verwendung von Bakterien, in denen eine ftir 
ein Methionin-Synthase (metH)-Gen kodierende Nukleotidsequenz exprimiert wird. 

Stand derTechnik 

Schwefelhaltige Feinchemikalien, wie zum Beispiel Methionin, Homocystein, S-Adenosy(- 
Methionin, Glutathion, Cystein, Biotin, Thiamin, LiponsSure werden Qber natQrliche Stoffwech- 
selprozesse in Zellen hergestellt und werden in vielen Industriezweigen verwendet, einschlieB- 
lich der Nahrungsmitteh Futtermittel-, Kosmetik- und pharmazeutischen Industrie* Diese Sub* 
stanzen, die zusammen als "schwefelhaltige Feinchemikalien'' bezeichnet werden, umfassen 
organische SSuren, sowohl proteinogene als auch nicht-proteinogene Aminosduren, Vitamine 
und Cofaktoren. Ihre Produktion erfolgt am zweckmSBigsten im GroBmaBstab mittels Anzucht 
von Bakterien, die entwickelt wurden, urn groBe Mengen der jeweils gewQnschten Substanzzu 
produzieren und sezemieren. Fur diesen Zweck besonders geeignete Organismen sind coryne- 
forme Bakterien, gram-positive nicht-pathogene Bakterien. 

Es ist bekannt, dass AminosSuren durch Fermentation von Stammen coryneformer Bakterien, 
insbesondere Corynebacterium glutamicum, hergestellt werden. Wegen der groBen Bedeutung 
wird standig an der Verbesserung der Herstellverfahren gearbeitet Verfahrensverbesservngen 
kdnnen femnentationstechnische MaBnahmen, wie zum Beispiel RQhrung und Versorgung mit 
Sauerstoff , oder die Zusammensetzung der NShrmedien, wie zum Beispiel die Zuckertonzentra- 
tion wShrend der Fermentation, oder die Aufarbeitung zum Produkt , beispielsweise durch lone- 
naustauschchromatographie, oder die intrinsischen Leistungseigenschaften des Mikroorganis- 
mus selbst betreffen. 

Ober Stammselektion sind eine Reihe von MutantenstSmmen entwickelt worden, die ein Sorti- 
mentwQnschenswerterVerbindungen aus der Reihe der schwefeihaltigen Feinchemikalien pro- 
duzieren. Zur Verbesserung der Leistungseigenschaften dieser Mikroorganismen hinsichtlich der 
Produktion eines bestimmten Molekuls werden Methoden der Mutagenese, Selektion und Mut- 
antenauswahl angewendet. Dies ist jedoch ein zeitaufwendiges und schwieriges Verfahren. Auf 
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diese Weise erhalt man z.B. Stamme, die resistent gegen Antimetabolite Oder Hemmstoffe, wie 
z. B. die Methionin-Analoga a-Methyl-Methionin, Ethionin, Norteucin, N-Acetyfnorleucin. S- 
Trifluoromethylhomocystein, 2-Amino-5-heprenoitsaure, Seleno-Methionin, Methioninsulfoximin, 
Methoxin, 1-Aminocyclopentan-Carboxylsaure oder auxotroph fQr regulatorisch bedeutsame 
5 Metabolite sind und schwefelhaltige Feinchemikalien, wie z. B. L-Methionin, produzieren. 

Seit einigen Jahren werden ebenfalls Methoden der rekombinanten DNA-Technik zur Stamm- 
verbesserung von L-Aminosaure produzierender Stamme von Corynebacterium eingesetzt, in- 
dem man einzelne Aminosaure-Blosynthesegene amplifiziert und die Auswirkung auf die Amino- 
10 sSure-Produktion untersucht. 

Die WO-A-02/10209 beschreibt ein Verfahren zur fermentativen Herstellung von L-Methionin 
unter Verwendung L-Methionin produzierender coryneformer Bakterien, worin wenigstens das 
metH-Gen Oberexprimiert ist t sowie die kodierende metH-Sequenz aus C. glutamicum ATCC 
15 13032. 

Kurze Beschreibuno der Erfinduno 

Der Erfindung lag die Aufgabe zugrunde, ein neues Verfahren zur verbesserten fermentativen 
20 Herstellung von schwefelhaltige Feinchemikalien, insbesondere L-Methionin, bereitzustellen. 

Gelfist wird obige Aufgabe durch Bereitstellung eines Verfahrens zur fennentativen Herstellung 
einer schwefelhaltigen Feinchemikalie, umfassend die Expression einer heterologen Nukleotid- 
sequenz, welche fQr ein Protein mit metH-Aktivitat kodiert, in einem coryneformen Bakterium. 

25 

Ein erster Gegenstand der Erfindung ist ein Verfahren zur fermentativen Herstellung wenigstens 
einer schwefelhaltigen Feinchemikalie, welches folgende Schritte umfasst 

a) Fermentation einer die gewunschte schwefelhaltige Feinchemikalie produzierenden co- 
ryneformen Bakterienkultur, wobei in den coryneformen Bakterien zumindest eine hetero- 

30 loge Nukleotidsequenz exprimiert wird, welche fQr ein Protein mit Methionin-Synthase 

(metH)-Aktivitat kodiert; 

b) Anreicherung der schwefelhaltigen Feinchemikalie im Medium Oder in den Zellen der 
Bakterien, und 



i 
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c) Isolieren der schwefelhaltigen Feinchemikalie, welche vorzugsweise L-Methionin um- 
fasst 

Vorzugsweise besitzt obige heterologe metH-kodierende Nukleotidsequenz zur metH- 
kodierenden Sequenz aus Corynebacterium glutamicum ATCC 13032 eine Sequenzhomologie 
vom weniger als 70% aufweist Die metH-kodierende Sequenz ist vorzugsweise aus einem der 
folgenden Organismen von Liste I abgeleitet 



Liste I 



Streptomyces coelicolor 


ATCC 10147 


Anabaena sp. 


ATCC 27892 


Synechocystis sp. 


ATCC 27184 


Prochlorococcus marinus 


PCC7118 


Themnus thermophilus 


ATCC 27634 


Bacillus halodurans 


ATCC 21591 


Bacillus stearothenmophilus 


ATCC 12980 


Vibrio cholerae 


ATCC 39315 


Sinorhizobium meliloti 


ATCC 4399 


Escherichia coli K12 


ATCC55151 


Salmonella typhimurium 


ATCC 15277 


Salmonella typhi 


ATCC 12839 


Pseudomonas fluorescens 


ATCC 13525 


Pseudomonas aeruginosa 


ATCC 17933 


Nitrosomonas europeae 


ATCC 19718 


Bordetella pertussis 


ATCC 9797 


Clorobium tepidum 


ATCC 49652 


Deinococcus radiodurans 


ATCC 13939 


Clostridium acetobutylicum 


ATCC 824 


Caulobacter crescentus 


ATCC 19089 


Homo sapiens 




Vibrio fischeri 


ATCC 33715 


Agrobacterium tumefaciens str. C58 (Cereon) 


ATCC 33970 


Ralstonia solanacearum 


ATCC 25237 



ATCC: American Type Culture Collection, Rockville, MO, USA 
PCC: Pasteur Culture Collection of Cyanobacteria. Paris Frankreich 

Die erfindungsgemaii eingesetzte metH-kodierende Sequenz umfasst voizugsweise eine kodie- 
rende Sequenz gemSli SEQ ID NO: 1 , 3, 5, 7, 9, 1 1 , 1 3, 1 5, 1 7, 1 9, 21 , 23, 25, 27, 29, 31 , 33, 35, 
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37, 39, 41, 43, 45, 47, 49 und 51 oder eine dazu homologe Nukleotidsequenz, welche fOr ein 
Protein mit metH-Aktivitat kodiert 

Die erfindungsgemaB eingesetzte metH-kodierende Sequenz kodiert auBerdem vorzugsweise 
5 fur ein Protein mit metH-Aktivitdt, wobei das Protein eine Aminosduresequenz gemaB SEQ ID 
NO:2, 4, 6, 8, 1 0, 12, 14, 16, 1 8, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50 und 
52 oder eine dazu homologe Aminosiuresequenz, welche for ein Protein mit metH-Aktivitat 
steht, umfasst 

10 Die kodierende metH-Sequenz ist vorzugsweise eine in coryneformen Bakterien replizierbare 
oder eine stabil in das Chromosom intregrierte DNA oder eine RNA. 

GemSB einer bevorzugten AusfOhrungsform wird das erfindungsgemaBe Verfahren durchge- 
filhrt, indem man 

15 

a) einen mit einem Plasmidvektor transformierten Bakterienstamm einsetzt der wenigstens 
eine Kopie der kodierende n metH-Sequenz unter der Kontrolle regulativerSequenzen tr&gtoder 

b) einen Stamm einsetzt, in dem die kodierende metH-Sequenz in das Chromosom des 
Bakteriums integriert wurde. 

20 

Es ist weiterhin bevorzugt, die kodierende metH-Sequenz fOr die Fermentation zu Oberexprimie- 
ren. 

AuBerdem kann es wunschenswert sein, Bakterien zu fermentieren, in denen zusStzlich wenigs- 
25 tens ein weiteres Gen des Biosyntheseweges der gewOnschten schwefelhaltigen Feinchemikalie 
oder eines damit assoziierten Biosynthese- oder sonstigen Stoffwechselweges, verstarkt ist und 
/oder 

in denen wenigstens ein Stoflwechselweg zumindest teilweise ausgeschaltet sind, der die Bil- 
dung der gewOnschten schwefelhaltigen Feinchemikalie verringert. 

30 

AuBerdem kann es wOnschenswert sein, Bakterien zu fermentieren, in denen zusStzlich wenigs- 
tens ein weiteres Gen des Biosyntheseweges der gewOnschten schwefelhaltigen Feinchemikalie 
durch Stoffwechselmetabolite in seiner Aktivitat nicht in unerwOnschter Weise beeinflusst wird. 



( 
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GemSli einer weiteren AusfQhrungsform des erfindungsgemSIXen Verfahrens werden deshalb 
coryneforme Bakterien fermentiert, in denen gleichzeitig wenigstens eines der Gene, ausgew3hlt 
unter 

a) dem fur eine Aspartatkinase kodierenden Gen lysC, 

b) dem fQr eine Aspartat-Semialdehyd-Dehydrogenase kodierenden Gen asd 

c) dem fQr die Glycerinaldehyd-3-Phosphat Dehydrogenase kodierenden Gen gap, 

d) dem fQr die 3-Phosphog!ycerat Kinase kodierenden Gen pgk, 

e) dem fQr die Pyruvat Carboxylase kodierenden Gen pyc, 

f) dem fQr die Triosephosphat Isomerase kodierenden Gen tpi, 

g) dem fur die Homoserin O-Acetyltransferase kodierenden Gen metA, 

h) dem fQr die Cystathionin-gamma-Synthase kodierenden Gen metB, 

i) dem fQr die Cystathionin-gamma-Lyase kodierenden Gen metC, 

j) dem fQr die Serin-Hydroxymethyltransferase kodierenden Gen glyA, 
k) dem fQr die O-Acetylhomoserin-Sulfhydrylase kodierenden Gen metY. 
I) dem fQr die Methylen-Tetrahydrofolat-Reduktase kodierenden Gen, metF 
m) dem fQr die Phosphoserin-Aminotransferase kodierenden Gen serC 
n) dem fur die Phosphoserin-Phosphatase kodierenden Gen serB, 
o) dem fQr die Serine Acetyl-Transferase kodierenden Gen cysE, 
p) dem fQr die Homoserin-Dehydrogenase kodierenden Gen horn, 
Qberexprimiert ist 

Gemall einer anderen AusfQhrungsform des erfindungsgemaBen Verfahrens werden corynefor- 
me Bakterien fermentiert, in denen gleichzeitig wenigstens eines der Gene ausgew3hlt unter 
Genen der oben genannten Gruppe a) bis p) mutiert ist, insbesondere so dass die korrespondie- 
renden Proteine, verglichen mit nicht mutierten Proteinen, in geringerem MaBe oder nicht durch 
Stoffwechselmetabolite in ihrer Aktivitat beeinflusst werden und dass insbesondere die erfin- 
dungsgemSlie Produktion der Feinchemikalie nicht beeintrSchtigt wind. Durch die Mutation kann 
das Protein auch eine hfihere Aktivitat (Sunstratumsatz) und/oder Sunstratspezifitat besitzen und 
damit die Produktion der gewunschten Feinchemikalie fOrdern. 
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dematJ einer anderen Ausfuhrungsform des erfindungsgemaBen Verfahrens werden corynefor- 
me Bakterien fermentiert, in denen gleichzeitig wenigstens eines der Gene, ausgewShlt unter 
q) dem fur die Homoserine-Kinase kodierenden Gen thrB, 
r) dem fOr die Threonin Dehydratase kodierenden Gen ilvA, 
s) dem fOr die Threonin Synthase kodierenden Gen thrC 
t) dem fQr die Meso-Diaminopimelat D-Dehydrogenase kodierenden Gen ddh 
u) dem fQr die Phosphoenolpyruvat-Carboxykinase kodierenden Gen pck, 
v) dem fOr die Glucose-6-Phosphat-6-lsomerase kodierenden Gen pgi, 
w) dem fQr die Pyruvat-Oxidase kodierenden Gen poxB, 
x) dem fQr die Dihydrodipicolinat Synthase kodiemden Gen dapA, 
y) dem fQr die Dihydrodipicolinat Reduktase kodiernden Gen dapB; Oder 
z) dem fQr die Diaminopicolinat Decarboxylase kodiernden Gen lysA 
abschwScht ist, insbesondere durch Verringerung der Expressionsrate des konespondierenden 
Gens, oder durch Expression eines Proteins mit geringerer Aktivitat (Substratumsatz). 

Gemaii einer anderen Ausfuhrungsform des erfindungsgemaBen Verfahrens werden corynefbr- 
me Bakterien fermentiert, in denen gleichzeitig wenigstens eines der Gene der obigen Gruppen 
q) bis z) mutiert ist, so dass die enzymatische Aktivitat des konespondierenden Proteins teilwei- 
se oder vollstdndig vemngert wird. 

Vorzugsweise werden in dem erfindungsgemaBen Verfahren Mikroorganismen der Art Coryne- 
bacterium glutamicum eingesetzt 

In einer weiteren Ausfuhrungsform des Verfahrens werden solche Mikroorganismen eingesetzt, 
die Resistenz gegenQber wenigstens einen Methionin-Biosynthesehemmer aufweisen. Solche 
Hemmer sind, ohne darauf beschrankt zu sein, Methionin-Analoga, wie a-Methyl-Methionin, 
Ethionin, Norieucin, N-Acetylnorieucin, S-Trifluoromethylhomocystein, 2-Amino-5- 
heprenoitsaure, Seleno-Methionin, Methioninsulfoximin, Methoxin, und 1-Aminocyclopentan- 
Carboxylsdure. 

Ein weiterer Gegenstand der Erfindung betrifft ein Verfahren zur Herstellung eines L-Methionin- 
haltigen Tierfuttermittel-Additivs aus Fermentationsbriihen, welches folgende Schritte umfasst 
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a) Kultivierung und Fermentation eines L-Methionin produzierenden Mikroorganis- 
mus in einem Fermentationsmedium; 

b) Entfemung von Wasser aus der L-Methionin haltigen Fermentationsbriihe; 

c) Entfemung der wShrend der Fermentation gebildeten Biomasse in einer Menge 
von 0 bis 100 Gew.-%; und 

d) Trocknung der gemafi b) und/oder c) erhaltenen Fermentationsbriihe, urn das 
Tierfuttermittel-Additiv in der gewOnschten Pulver- Oder Granulatform zu erhalten. 

Gegenstand der Erfindung sind ebenfalls die erstmalig aus obigen Mikroorganismen isolierten 
kodierenden metH-Sequenzen, die davon kodierten Methionin-Synthasen sowie die funktionalen 
Homologen dieser Polynukleotide bzw. Proteine. 

Gegenstand der Erfindung sind insbesondere auch die zur DurchfOhrung obiger Verfahren not- 
wendigen Expressionskonstrukte und Mikroorganismen. 

Weitere GegenstSnde der Erfindung sind somit insbesondere: 

- das Plasmid pCIS lysC thr31 1 He, kodierend fur lysC thr31 1 ile Oder ein f unktionales Aquivalent 
davon, d.h. eine lysC-Mutante mit vergleichbarer, gegenOber dem Wildtyp erhdhter Aspartatki- 
nase-Aktivitat; 

- ein Wirtsorganismus transformiert mit dem Plasmid pCIS lysC thr31 1 ile, insbesondere ausge- 
wahlt unter Mikroorganismen der Gattung Corynebacterium, insbesondere der Art C. glutami- 
cum, wie der transformierte Stamm LU1479 lysC 31 1ile; 

- das Plasmid pC Phsdh metH^Sc, kodierend fQr metH aus Streptomyces cx>elicolon 

- ein Wirtsorganismus gemSli obiger Definition, transformiert mit einem Plasmid, kodierend fflr 
exgonenes metH; insbesondere transformiert mit dem Plasmid pC Phsdh metH_Sc; 

- ein Wirtsorganismus gemaii obiger Definiiton mit Resistenz gegen wenigstens einen Methio- 
nin-Biosynthesehemmstoff, wie der transformierte Stamm LU 1479 lysC 311ile ET-16, gege- 
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benenfalls transformiert mit einer exogenen kodierenden metH Sequenz, wie der transformierte 
Stamm LU1479 lysC 31 1i!e ET-16 pC Phsdh metH_Sc. 

Detaillierte Beschreibung der Erfinduno 

a) Allgemeine Begriffe 

Als Proteine mit der biologischen Aktivitat der Methionine-Synthase, kurz auch als metH genannt 
(systematische Bezeichnung: 5-Methyltetrahydrofolat-Homocystein S-Methyltransferase ;EC 
2.1 .1 .1 3) t werden solche Proteine bezeichnet, die in der Lage sind Homocystein unter Verwen- 
dung der Cofaktoren 5-Methyltetrahydrofoiat (MTHF), Cobalamin (Vitamin B12) und S-Adenosyl- 
Methionin zu Methionin und Tetrahydrofolat umzusetzen. Wahrend der Cofaktor 5- 
l^ethyltetrahydrofolatstechiometrisch in die Reaktion miteingeht (1mol MTHFMMol Methionin 
gebildet) wird, wie in der Literatur beschrieben.S-Adenosyl-Methionin substOchiometrisch um- 
gesetzt Cobalamin hingegen ist katalytisch an der Umsetzung beteiligt. Dem Fachmann sind 
weitere Details des metH-Proteins bekannt. (Banerjee R.V., Matthews R.G. FASEB J. 4:1450- 
1459, 1990, Ludwig ML Matthews RG. Annual Review of Biochemistry. 66:269-313, 1997, 
Drennan CL. Matthews RG. Ludwig ML. Current Opinion in Structural Biology. 4:919-29, 1994). 
Der Fachmann unterscheidet die Aktivitat der Cobalamin-abhSngigen 5-Methyltetrahydrofolat- 
Homocystein S-Methyltransferase von der der Cobalamin-unabhSngigen 5-Methyltetrahydro- 
Reroyttriglutamat-Homocystein S-Methyltransferase (EC 2.1.1.14) auch metE genannt Der 
Fachmann kann die enzymatische Aktivitat von metH durch Enzymtests nachweisen, Vorschrif- 
ten dafflr kSnnen sein: Jarrett JT. Goulding CW. Fluhr K. Huang S. Matthews RG. Methods in 
Enzymology. 281:196-213, 1997. 

Im Rahmen der vorliegenden Erfindung umfasst der Begriff w schwefelhaltige Feinchemikalie" 
jegliche chemische Verbindung, die wenigstens ein Schwefelatom kovalent gebunden enthSIt 
und durch ein erfindungsgemaiies Fermentationsverfahrens zuganglich ist Nichtlimitierende 
Beispiele dafur sind Methionin, Homocystein, S-Adenosyl-Methionin, insbesondere Methio- 
nin.und S-Adenosyl-Methionin. 
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Im Rahmen der vorliegenden Erfindung umfassen die Begriffe L-Methionin, Methionin, Homo- 
cystein und S-Adenosylmethionin auch die korrespondierenden Salze, wie z. B. Methionin- 
Hydrochlorid oder Methionin-Sulfat 

"Polynukleotide" bezeichnet im allgemeinen Polyribonukleotide (RIMA) und Polydeoxyribonukleo- 
tide (DNA), wobei es sich urn nlcht modifizierte RNA oder DNA oder modifizierte RNA oder DNA 
handeln kann. 

Unter "Polypeptide^ versteht man erfindungsgemali Peptide oder Proteine, die zwei oder mehr 
Ober Peptidbindungen vertoundene AminosSuren enthalten. 

Der Begriff ..Stoffwechselmetabolit" bezeichnet chemische Verbindungen, die im Stoffwechsel 
von Organtsmen als Zwischen- oder auch Endprodukte vorkommen und die neben ihrer Eigen- 
schaft als chemische Bausteine auch modulierende Wirkung auf Enzyme und ihre katalytische 
Aktivitat haben kOnnen. Dabei ist aus der Literatur bekannt, dass solche Stoffwechselmetabolite 
sowohl hemmend als auch sfimulierend auf die Aktvitat von Enzymen wirken kBnnen (Bioche- 
mistiy, Stryer, Lubert, 1995 W. H. Freeman & Company, New York, New York.). In der Literatur 
ist auch beschrieben, dass es mdglich ist durch Malinahmen wie Mutation der genomischen 
DNA durch UV-Strahlung, ionisierender Strahlung oder mutagene Substanzen und nachfolgen- 
der Selektion auf bestimmte PhSnotypen in Organismen solche Enzyme zu produzieren, In de- 
nen die Beeinflussung durch Stoffwechselmetabolite verandert wurde (Sahm H. Eggeling L. de 
Graaf AA. Biological Chemisby 381(9.10):899-910, 2000; Eikmanns BJ. Eggeling L Sahm H. 
Antonie van Leeuwenhoek. 64:145^3, 1993-94). Diese verSnderten Eigenschaften kfinnen auch 
durch gezielte Malinahmen erreicht werden. Dabei ist dem Fachmann bekannt, in Genen fOr 
Enzyme auch gezielt bestimmte Nukleotide der fOr das Protein kodierenden DNA so zu vetfn- 
dem, dass das aus der exprimierten DNA-Sequenz resultierende Protein bestimmte neue Ei- 
genschaften aufweist. so zum Beispiel, dass die modulierende Wirkung von Stoffwechselmeta- 
boliten gegenOber dem nicht veranderten Protein verflndert ist 

Enzyme kdnnen derart in ihrer Aktivitat beeinflulit werden, dass es zu einer Vemngerung der 
Reaktionsgeschwindigkeit, oder zu einer Ver§nderung der Affinitat gegenuber dem Substrat 
oder zu einer Anderung der Reaktionsgeschwindigkeiten kommt 
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Die Begriffe "exprimieren" bzw. ■Verstarkung-oder.Oberexpression" beschreiben im Kontext der 
Erfindung die Produktion bzw. ErhChung der intrazellulSren Aktivitat eines Oder mehrerer Enzy- 
me in einem Mikroorganismus, die durch die entsprechende DNA kodiert werden. Dazu kann 
man beispielsweise ein Gen in einen Organismus einbringen, ein vortiandenes Gen durch ein 
anderes Gen ersetzen, die Kopienzahl des Gens bzw. der Gene erhdhen, einen starken Promo- 
tor verwenden Oder ein Gen verwenden, das for ein entsprechendes Enzym mit einer hohen 
Aktivitat kodiert und man kann gegebenenfalls diese Maftnahmen kombinieren. 

b) ErfindungsgemaBe metH-Proteine 

Erfindungsgemau mit umfasst sind ebenfalls .funktionale Aquivalente" der konkret offenbarten 
metH-Enzyme aus Organismen obiger Liste I. 

.Funktionale Aquivalente" oder Anaioga der konkret offenbarten Polypeptide sind im Rahmen 
der vorliegenden Erfindung davon verschiedene Polypeptide, welche weiterhin die gewOnschte 
biologische Aktivitat, wie z.B. Substratspezifitat, besitzen. 

Unter "funktionalen Aquivalenten" versteht man erfindungsgemaB insbesondere Mutanten, wel- 
che in wenigstens einer der oben genannten Sequenzpositionen eine andere als die konkret 
genannte Aminosaure aufweisen aber trotzdem eine der oben genannten biologischen Aktivita- 
ten besitzen. "Funktionale Aquivalente" umfassen somitdie durch eine oder mehrere Aminosau- 
re-Additionen, -Substitutionen, -Deletionen und/oder-lnversionen erhaitlichen Mutanten, wobei 
die genannten Veranderungen in Jeglicher Sequenzposition auftreten kdnnen, solange sle zu 
einer Mutante mit dem erfindungsgemafJen Eigenschaftsprofil fQhren. Funktionale Aquivaienz ist 
insbesondere auch dann gegeben, wenn die Reaktivitatsmuster zwischen Mutante und unver- 
andertem Polypeptid qualitativ Obereinstimmen, d.h. beispielsweise gleiche Substrate mit unter- 
schiedlicher Geschwindigkeit umgesetzt werden. 

Tunktionale Aquivalente" umfassen naturlich auch Polypeptide welche aus anderen Organis- 
men zuganglich sind, sowie natQrlich vorkommende Varianten. Beispielsweise lassen sich durch 
Sequenzvergleich Bereiche homologer Sequenzregionen festlegen und in Anlehnung an die 
konkreten Vorgaben der Erfindung aquivalente Enzyme ermitteln. 
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.Funktionale Aquivalente" umfassen ebenfalls Fragmente, vorzugsweise einzelne Domanen 
oder Sequenzmotive, der erfindungsgemilien Polypeptide, welche z.B. die gewflnschte biologi- 
sche Funktion aufweisen. 

5 .Funktionale Aquivalente* slnd aulierdem Fusionsproteine, welche ein der oben genannten Po- 
lypeptidsequenzen oder davon abgeleitete funktionale Aquivalente und wenigstens eine weitere, 
davon funktionell verschiedene, heterologe Sequenz in funktioneller N- oder C-terminaler Ver- 
knQpfung (d.h. ohne gegenseitigen wesentliche funktionelle Beeintnachtigung der Fusionsprote- 
inteile) aufweisen. Nichtlimitiemde Beispiele fOr derartige heterologe Sequenzen sind z.B. Sig- 
1 0 nalpeptide, Enzyme, Immunoglobuline, Oberfiachenantigene, Rezeptoren oder Rezeptorligan- 
den. 

ErfindungsgemaB mit umfasste B funktionale Aquivalente* sind Homologe zu den konkret offen- 
barten Proteinen. Diese besitzen, beispielsweise Ober die gesamte Lange, wenigstens 30%, 
1 5 oder etwa 40%, 50 %, vorzugsweise wenigstens etwa 60 %, 65%, 70%, oder 75% ins besondere 
wenigsten 85 %, wie z.B. 90%, 95% oder 99%, Homologie zu einer der konkret offenbarten Se- 
quenzen, berechnet nach dem Algorithmus von Pearson und Lipman, Proc. Natl. Acad, Sci. 
(USA) 85(8), 1 988, 2444-2448. Der Homologiegrad spiegelt insbesondere den Grad der Identitat 
zwischen veranderter und nicht veranderter Sequenz wider. 

20 

Homologe der erfindungsgemaden Proteine oder Polypeptide kdnnen durch Mutagenese er- 
zeugt werden, z.B. durch Punktmutation oderVerkurzung des Proteins. Der Begriff "Homolog*, 
wie er hier verwendet wird, betrifft auch eine variante Form des Proteins, die als Agonist oder 
Antagonist der Protein-Aktivitat wirkt. 

25 

Homologe des erfindungsgemaiien Proteine konnen durch Screening kombinatorischer Banken 
von Mutanten, wie z.B. VerkOrzungsmutanten, identifiziert werden. Beispielsweise kann eine 
variegierte Bank von Protein-Varianten durch kombinatorische Mutagenese auf Nukleinsaure- 
ebene erzeugt werden, wie z.B. durch enzymatisches Ligieren eines Gemisches synthetischer 
30 Oligonukleotide. Es gibt eine Vielzahl von Verfahren, die zur Herstellung von Banken potentieller 
Homologer aus einer degenerierten Oligonukleotidsequenz verwendet werden konnen. Die 
chemische Synthese einer degenerierten Gensequenz kann in einem DNA-Syntheseautomaten 
durchgefOhrt werden, und das synthetische Gen kann dann in einen geeigneten Expressions- 
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vektor ligiert werden. Die Verwendung eines degenerierten Gensatzes ermdglichtdie Bereitstel- 
lung samtlicher Sequen2en in einem Gemisch, die den gewOnschten Satzan potentielleri Prote- 
insequenzen codieren. Verfahren zur Synthese degenerierter Oligonukleotide sind dem Fach- 
mann bekannt(Z.B. Narang, SA (1983) Tetrahedron 39:3; Itakuraetal. (1984)Annu.Rev. Bio- 
5 chem. 53:323; Itakura et al., (1984) Science 198:1056; Ike et al. (1983) Nucleic Acids Res. 
11:477). 

Zusfltzlich k6nnen Banken von Fragmenten des Protein-Codons verwendet werden, um eine 
variegierte Population von Protein-Fragmenten zum Screening und zur anschlieBenden Selekti- 

1 0 on von Homologen eines erfindungsgemaBen Proteins zu erzeugen. Bei einer AusfOhrongsform 
kann eine Bank von kodierenden Sequenzfragmenten durch Behandeln eines doppelstrSngigen 
PCR-Fragmentes einer kodierenden Sequenz mit einer Nuklease unter Bedingungen, unter de- 
nen ein Nicking nur etwa einmal pro MolekOI erfolgt, Denaturieren der doppelstrSngigen DNA, 
Renaturieren der DNA unter Bildung doppelstrangiger DNA, die Sense-/Antisense-Paare von 

1 5 verschiedenen genickten Produkten umfassen kann, Entfemen einzelstrSngiger Abschnitte aus 
neu gebildeten Duplices durch Behandlung mit S1-Nuclease und Ligieren der resultierenden 
Fragmentbank in einen Expressionsvektorerzeugt werden. Durch dieses Verfahren kann eine 
Expressionsbank hergeleitet werden, die N-terminale, C-terminale und interne Fragmente mit 
verschiedenen GnSBen des erfidungsgemaBen Proteins kodiert 

20 

Im Stand der Technik sind mehrere Techniken zum Screening von Genprodukten kombinatori- 
scher Banken, die durch Punktmutationen Oder Verkurzung hergestelltworden sind, und zum 
Screening von cDNA-Banken auf Genprodukte mit einer ausgewShlten Eigenschaft bekannt 
Diese Techniken lassen sich an das schnelle Screening der Genbanken anpassen, die durch 

25 kombinatorische Mutagenese erfindungsgemSBer Homologer erzeugt worden sind. Die am hau- 
figsten verwendeten Techniken zum Screening groBer Genbanken, die einer Analyse mithohem 
Durchsatzunteriiegen, umfassen das Klonieren derGenbankin replizierbare Expressionsvekto- 
ren, Transformieren der geeigneten Zellen mit der resultierenden Vektorenbank und Exprimieren 
der kombinatorischen Gene unter Bedingungen, unter denen der Nachweis der gewOnschten 

30 Aktivitat die Isolation des Vektors. der das Gen codiert, dessen Produkt nachgewiesen wurde, 
erleichtert. Recursive-Ensemble-Mutagenese (REM), eine Technik, die die Haufigkeitfunktionel- 
ler Mutanten in den Banken vergrSBert, kann in Kombination mit den Screeningtests verwendet 
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werden, urn Homologe zu identifizieren (Arkin und Yourvan (1992) PNAS 89:781 1-7815; Delgra- 
ve et al. (1 993) Protein Engineering 6(3):327-331 

c) ErfindunQsaemafte Polvnukleotide 

5 

Gegenstand der Erfindung sind ebenso NukleinsSuresequenzen (einzel- und doppelstrangige 
DNA- und RNA-Sequenzen, wie z.B. cDNA und mRNA), kodierend fOreines der obigen metH- 
Enzyme und deren funktionalen Aquivalenten, welche z.B. auch unter Verwendung kOnstlicher 
Nukleotidanaloga zugSnglich sind. 

10 

Die Erfindung betrifft sowohl isoiierte NukleinsSuremoIekOle, welche fOr erfindungsgemSBe Po- 
lypeptide bzw. Proteine Oder biologisch aktive Abschnitte davon kodieren, sowie NukleinsSure- 
fragmente, die z.B. zur Verwendung ats Hybridisierungssonden oder Primer zur Identifizierung 
Oder Amplifizierung von erfindungsgemS&er kodierenden NukleinsSuren verwendet werden kdn- 
15 nen. 

Die erfindungsgemd&en NukleinsduremolekQIe kdnnen zudem untranslatierte Sequenzen vom 
3 - und/oder 5 -Ende des kodierenden Genbereichs enthalten 

20 Ein "isoliertes" NukleinsSuremolekul wind von anderen NukleinsauremolekOlen abgetrennt die in 
der natOriichen Quelle der Nukleinsaure zugegen sind und kann Qberdies im wesentlichen frei 
von anderem zellulSren Material oder Kulturmedium sein, wenn es durch rekombinante Technl- 
ken hergestellt wind, oder frei von chemischen Vorstufen oder anderen Chemikalien sein, wenn 
es chemisch synthetisiert wird. 

25 

Die Erfindung umfasst weiterhin die zu den konkret beschriebenen Nukleotidsequenzen kom- 
plementaren NukleinsauremolekOle oder einen Abschnitt davon. 

Die erfindungsgemaB Nukleotidsequenzen ermOglichen die Erzeugung von Sonden und Pri- 
30 mem, die zur Identifizierung und/oder Klonierung von homologer Sequenzen in anderen Zellty- 
pen und Organismen verwendbar sind. Solche Sonden bzw. Primer umfassen gewflhnlich einen 
Nukleotidsequenzbereich, der unter stringenten Bedingungen an mindestens etwa 12, vorzugs- 
weise mindestens etwa 25, wie z.B. etwa 40, 50 oder 75 aufeinanderfolgende Nukleotide eines 
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Sense-Stranges einer erfindungsgemaiJen NukleinsSuresequenz oder eines entsprechenden 
Antisense-Stranges hybridisiert. 

Weitere erfindungsgemflBe Nukleinsauresequenzen sind abgeleitet von SEQ ID NO:1, 3, 5, 7, 9, 
5 11, 13 f 15, 17, 19.21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49 Oder 51 und unter- 
scheiden sich davon durch Addition, Substitution, Insertion oder Deletion einzelnerodermehre- 
ner Nukleotide, kodieren aber weitertiin fOr Polypeptide mit dem gewOnschten Eigenschaftsprofil. 
Dies kOnnen Polynukleotide sein, die zu obigen Sequenzen, beispielsweise uber die gesamte 
Lange, in mindestens etwa 50%, 55%, 60%, 65%, 70%, 80% oder 90%, vorzugsweise in min- 
1 0 destens etwa 95%, 96%, 97%, 98% oder 99% der Sequenzpositionen identisch sind. 

Erfindungsgemafi umfasst sind auch solche Nukleinsauresequenzen, die sogenannte stumme 
Mutationen umfassen oder entsprechend der Codon-Nutzung eins speziellen Ursprungs- oder 
Wirtsorganismus, im Vergteich zu einer konkret genannten Sequenz verSndert sind, ebenso wie 
1 5 natOrlich vorkommende Varianten, wie Z.B. SpleilSvarianten oder Allelvarianten, davon. Gegens- 
tand sind ebenso durch konservative Nukleotidsubstutionen (d.h. die betreffende Aminosaure 
wind durch eine Aminosaure gleicher Ladung, GnBBe, Polaritat und/oder Lflslichkeit ersetzt) er- 
haitliche Sequenzen. 

20 Gegenstand der Erfindung sind auch die durch Sequenzpolymorphismen von den konkret offen- 
barten NukleinsSuren abgeleiteten MolekOle. Diese genetischen Polymorphismen kOnnen zwi- 
schen Individuen innerhalb einer Population aufgrund der natOrlichen Variation existieren. Diese 
natOriichen Variationen bewirken Oblichenweise eine Varianz von 1 bis 5 % in der IMukleotidse- 
quenz eines Gens. 

Weiterhin umfasst die Erfindung auch Nukleinsauresequenzen, welchen mit oben genannten 
kodierenden Sequenzen hybridisieren oderdazu komplementSrsind. Diese Polynukleotide las- 
sen sich bei Durchmusterung von genomischen oder cDNA-Banken auffinden und gegebenen- 
falls daraus mit geeigneten Primem mittels PCR vermehren und anschliedend beispielsweise 
mit geeigneten Sonden isolieren. Eine weitere Moglichkeit bietet die Transformation geeigneter 
Mikroorganismen mit erfindungsgemaBen Polynukleotiden oder Vektoren, die Vermehrung der 
Mikroorganismen und damit der Polynukleotide und deren anschlieBende Isolierung. DarOber 
hinaus kfinnen erfindungsgemaBe Polynukleotide auch auf chemischem Wege synthetisiert 
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werden. 

Unter der Eigenschaft, an Polynukleotide .hybridisieren" zu kdnnen, versteht man die Fahigkeit 
eines Poly- oder Oligonukleotids unter stringenten Bedingungen an eine nahezu komplementare 
Sequenz zu binden, wShrend unter diesen Bedingungen unspezifische Bindungen zwischen 
nicht-komplementaren Partnem unterbleiben. Dazu soliten die Sequenzen zu 70-100%, vor- 
zugsweise zu 90-100%, komplementar sein. Die Eigenschaft komplement3rer Sequenzen, 
spezifisch aneinander binden zu kdnnen, macht man sich beispielsweise in der Northern- oder 
Southem-Blot-Technik oder bei der Primerblndung in PCR oder RT-PCR zunutze. Oblicherweise 
werden dazu Oligonukleotide ab einer LSnge von 30 Basenpaaren eingesetzL Unter stringenten 
Bedingungen versteht man beispielsweise in der Northem-Blot-Technik die Verwendung einer 
50 - 70 °C, vorzugsweise 60 - 65 °C warmen Waschlfisung, beispielsweise 0,1 x SSC-Puffer mit 
0,1% SDS (20x SSC: 3M NaClr0,3M Na-Citrat, pH 7,0) zur Elution unspezifisch hybridisierter 
cDNA-Sonden oder Oligonukleotide. Dabei bleiben, wie oben erwShnt, nur in hohem Mafce 
komplementare Nukleinsiuren aneinander gebunden. Die Einstellung stringenter Bedingungen 
ist dem Fachmann bekannt und ist z:B. in Ausubel et al., Current Protocols in Molecular Biology, 
John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. beschrieben. 

c) Isolieajna der kodierenden metH-Gene 

DiefQrdas Enzym Methionin-Synthase(EC 2.1.1.13) kodierenden metH Gene aus den Organis- 
men obiger Uste I sind in an sich bekannter Weise isolierbar. 

Zur Isolierung der metH-Gene oder auch anderer Gene der Organismen aus obiger Liste I wird 
zunSchst eine Genbank dieses Organsimus in Escherichia coli (E. coli) angelegt Das Anlegen 
von Genbanken ist in allgemein bekannten LehrbOchem und HandbOchem ausfOhriich beschrie- 
ben. Als Beispiel seien das Lehrbuch von Winnacker Gene und Klone, Eine EinfQhrung in die 
Gentechnologie (Verlag Chemie, Weinheim, Deutschland, 1990), oder das Handbuch von 
Sambrook et al.: Molecular Cloning, A Laboratory Manual (Cold Spring Harbor Laboratory Press, 
1 989) genannt. Eine sehr bekannte Genbank ist die des E. coli K-12 Stammes W31 10, die von 
Kohara et al. (Cell50, 495-508 (198)) in X-Vektoren angelegt wurde. 
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Zur Herstellung einer Genbank von Organismen der Liste I in E. coli k6nnen Cosmide, wie der 
CosmidvektorSuperCos I (Wahl et al., 1987 ( Proceedings of the National Academy of Sciences 
USA, 84: 2160-2164), aber auch Plasmide, wie pBR322 (BoliVal; Life Sciences, 25, 807-818 
(1979)) oder pUC9 (Vieira etal., 1982, Gene, 19: 259-268), verwendetwerden. Als Wirteeignen 
5 sich besonders solche E. coli Stamme, die restriktions- und rekombinationsdefekt sind. Ein Bei- 
spiel hierfOr ist der Stamm DH5amcr, der von Grant et al. (Proceedings of the National Academy 
of Sciences USA, 87 (1 990) 4645-4649) beschrieben wurde. Die mit Hilfe von Cosmiden klonier- 
ten langen DNA-Fragmente k6nnen anschliedend wiedeaim in gSngige.fOrdie Sequenzierung 
geeignete Vektoren subkloniert und anschlieliend sequenziert werden, so wie es z. B. bei San- 
1 0 ger et al. (proceedings of the National Academy of Sciences of the United States of America, 74: 
5463-5467, 1 977) beschrieben ist 

Die erhaltehen DNA-Sequenzen kflnnen dann mit bekannten Algorithmen bzw. Sequenzanalyse- 
Programmen, wie z. B. dem von Staden (Nucleic Acids Research 14,217-232(1986)), dem von 
15 Marck (Nucleic Acids Research 16, 1829-1836 (1988)) oder dem GCG-Programm von Butler 
(Methods ofBiochemical Analysis 39, 74-97 (1998)), untersucht werden. 

Die fur die metH-Gene kodierenden DNA-Sequenzen von Organismen gemSB obiger Liste I 
wurden gefunden. Insbesondere wurden DNA-Sequenzen gemSB gemaB SEQ ID NO:1 f 3, 5, 7, 
20 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49 und 51 gefunden. 
Weiterhin wurde aus diesen vorliegenden DNA-Sequenzen mit den oben beschriebenen Metho- 
den die AminosSuresequenzen der entsprechenden Proteine abgeleitet Durch SEQ ID NO:2, 4, 
6, 8, 1 0, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50 und 52 sind 
die sich ergebenden AminosSuresequenzen der metH Genprodukte dargestellt 

25 

Kodierende DNA-Sequenzen, die sich aus den Sequenzen gemaB SEQ ID NO:1, 3, 5 f 7, 9, 11, 
13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49 und 51 durch die Dege- 
neration des genetischen Kodes ergeben, sind ebenfalls Gegenstand der Erfindung. In gleicher 
Weise sind DNA-Sequenzen, die mit diesen Sequenzen oder davon abgeleiteten Sequenzteilen 
30 hybridisieren, Gegenstand der Erfindung. 

Anleitungen zur Identifizierung von DNA-Sequenzen mittels Hybridisierung findet der Fachmann 
unter anderem im Handbuch The DIG System Users Guide fOr Filter Hybridization" der Firma 
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Boehringer Mannheim GmbH (Mannheim, Deutschland, 1993) und bei Liebl etal. (International 
Journal of Systematic Bacteriology (1991 ) 41 : 255-260). Anleitungen zur Amplication von DNA- 
Sequenzen mit Hilfe der Polymerase-Kettenreaktion (PCR)findetder Fachmann unteranderem 
im Handbuch von Gait: Oligonukleotide synthesis: A Practical Approach (IRL Press, Ox- ford, 
UK, 1984) und bei Newton und Graham: PCR (Spektrum Akademischer Veriag, Heidelberg, 
Deutschland, 1994). 

Weiterhin ist bekannt, dass Anderungen am N- und/oder C- Terminus eines Proteins dessen 
Funktion nichtwesentlich beeintrachtigenodersogarstabilisienen k6nnen. Angaben hierzufindet 
der Fachmann unter anderem bei Ben-Bassat et al. (Journal of Bacteriology 169: 751-757 
(1 987)), bei O'Regan et al. (Gene 77: 237-251 (1 989), bei Sahin-Toth et al. (Protein Sciences 3: 
240-247 (1994)), bei Hochuli et al. (Biontechnology 6: 1321-1325 (1988)) und in bekannten 
Lehrbuchem der Genetik und Molekularbiologie. 

Aminosauresequenzen, die sich in entsprechender Weise aus den SEQ ID NO:2, 4, 6, 8, 10, 12, 
14, 16, 18, 20, 22. 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50 und 52 ergeben, sind 
ebenfalls Bestandteil der Erfindung. 

d) ErfindunosaemaB venvendete Wirtszellen 

Weitere GegenstSnde der Erfindung betreffen als Wirtszelle dienende Mikroorgansismen, insbe- 
sondere coryneforme Bakterien, die einen Vektor, insbesondere Pendelvektor Oder Plasmidvek- 
tor, der wenigstens ein metH Gen gerfindungsgemaBer Definition trSgt, enthalten Oder in denen 
ein erfindungsgemSBes metH Gen exprimiert bzw. verstSrkt ist. 

Diese Mikroorganismen k6nnen schwefelhalfige Feinchemikalien, insbesondere L-Methionin, 
aus Glucose, Saccharose, Lactose, Fructose, Maltose, Melasse, Starke, Cellulose oder aus Gly- 
cerin und Ethanol herstellen. Vorzugsweise sind dies coryneforme Bakterien, insbesondere der 
Gattung Corynebacterium. Aus der Gattung Corynebacterium ist insbesondere die Art Coryne- 
bacterium glutamicum zu nennen, die in der Fachwelt fOr ihre Fahigkeit bekannt ist, L- 
Aminosduren zu produzieren. 
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Als Beispiele fur geeignete Stamme coiyneformer Bakterien sind solche der Gattung Corynebac- 
terium, insbesondere der Art Corynebacterium glutamicum (C. glutamicum), wie 
Corynebacterium glutamicum ATCC 1 3032 f 
Corynebacterium acetoglutamicum ATCC 1 5806, 
5 Corynebacterium acetoacidophilum ATCC 1 3870, 
Corynebacterium thermoaminogenes FERM BP-1 539, 
Corynebacterium melassecola ATCC 17965 

oder 

1 0 der Gattung Brevibacterium, wie 

Brevibacterium flavum ATCC 14067 

Brevibacterium lactofermentum ATCC 1 3869 und 

Brevibacterium divaricatum ATCC 14020 zu nennen; 

Oder davon abgeleitete Stdmme, wie 
1 5 Corynebacterium glutamicum KFCC1 0065 

Corynebacterium glutamicum ATCC21608 

welche ebenfalls die gewQnschte Feinchemikalie oder deren Vorstufe(n) produzieren. 
Mit der AbkOrzung KFCC ist die Korean Federation of Culture Collection gemeint, mit der AbkOr- 
20 zung ATCC die American type strain culture collection und mit der AbkOrzung FERM die Samm- 
lung des National institute of Bioscience and Human-Technology, Agency of Industrial Science 
and Technology, Japan. 

e) DurchfOhrunq der erfindunqsQemaBen Fermentation 

25 

ErfindungsgemaS wurde festgestellt, dass coryneforme Bakterien nach Oberexpression eines 
metH Gens aus Organismen der Liste I in vorteilhafter Weise schwefelhaltige Feinchemikalien, 
insbesondere L-Methionin, produzieren. 

30 Zur Erzielung einer Oberexpression kann der Fachmann unterschiedliche MaBnahmen einzeln 
oder in Kombination ergreifen. So kann die Kopienzahl der entsprechenden Gene erh6ht wer- 
den, oder es kann die Promoter- und Regulationsregion oder die Ribosomenbindungsstelle, die 
sich stromaufwarts des Strukturgens befindet, mutiert werden. In gleicher Weise wirken Expres- 
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isionskassetten, die stromaufwSrts des Strukturgens eingebaut werden. Durch induzierbare Pro- 
motoren ist es zusStzlich mOglich, die Expression im Verlaufe der fermentativen L-Methionin- 
Produktionzu steigem. Durch MalJnahmen zurVeriSngerung der LebensdauerdermRNAwird 
ebenfalls die Expression verbessert Weiterhin wird durch Verhjndenjng des Abbaus des En- 
5 zymproteins ebenfalls die Enzymaktivitat verstarkt. Die Gene oder Genkonstrukte kennen ent- 
weder in Plasmiden mit unterschiedlicher Kopienzahl vorliegen oder im Chromosom integriert 
und amplifiziert sein. Altemativ kann weiterhin eine Oberexpression der betreffenden Gene 
durch Veranderung der Medienzusammensetzung und KulturfQhrung erreicht wertlen. 

1 0 Anleitungen hierru findet der Fachmann unter anderem bei Martin et al. (Biontechnology 5, 1 37- 
146 (1987)), bei Guerrero et al. (Gene 138, 35-41 (1994)), Tsuchiya und Morinaga 
(Bio/Technology 6, 428^30 (1 988)), bei Eikmanns et al. (Gene 1 02, 93-98 (1991 )), in der Euro- 
paischen Patentschrift 0472869, im US Patent 4,601 ,893, bei Schwarzer und PQhler (Biotechno- 
logy 9, 84-87 (1991), bei Remscheid et al. (Applied and Environmental Microbiology 60,126-132 

1 5 (1994), bei LaBarre et al. (Journal of Bacteriology 175, 1001-1007 (1993)), in der Patentanmel- 
dung WO 96/1 5246. bei Malumbres et al. (Gene 1 34, 1 5-24 (1 993)), in der japanischen Offenle- 
gungsschrift JP-A-1 0-229891, bei Jensen und Hammer (Biotechnology and Bioengineering 
58..191-195 (1 998)), bei Makrides (Microbiological Reviews 60 : 512-538 (1996) und in bekann- 
ten Lehrbuchem der Genetik und Molekularbiologie. 

20 

Gegenstand der Erfindung sind deshalb auch Expressionskonstrukte, enthaltend unter der ge- 
netischen Kontrolle regulativer NukleinsSuresequenzen eine fOr ein erfindungsgemSBes Poly- 
peptid kodierende NukleinsSuresequenz; sowie Vektoren, umfassend wenigstens eines dieser 
Expressionskonstrukte. Vorzugsweise umfassen solche erfindungsgemaBen Konstrukte 5- 

25 stromaufwarts von der jeweiligen kodierenden Sequenz einen Promotor und S'-stromabwarts 
eine Terminatorsequenz sowie gegebenenfalls weitere Obliche regulative Elemente, und zwar 
jeweils operativ verknOpfl mit der kodierenden Sequenz. Unter einer „operativen VerknQpfung* 
versteht man die sequentielle Anordnung von Promotor, kodierender Sequenz, Terminator und 
gegebenenfalls weiterer regulativer Elemente derart, dass jedes der regulativen Elemente seine 

30 Funktion bei der Expression der kodierenden Sequenz bestimmungsgemali erfQIIen kann. Bei- 
spiele fOr operativ verknOpfbare Sequenzen sind Aktivrieungssequenzen sowie Enhancer und 
dergleichen. Weitere regulative Elemente umfassen selekfierbare Marker, Amplifikationssignale, 
ReplikationsursprOnge und dergleichen. Geeignete regulatorische Sequenzen sind z.B. be- 
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schrieben in Goeddel, Gene Expression technology: Methods in Enzymology 185, Academic 
Press, San Diego, CA (1 990). 

ZusStzlich zu den artifiziellen Regulationssequenzen kann die natQrliche Regulationssequenz 
vor dem eigentlichen Strukturgen noch vorhanden sein. Durch genetische VerSnderung kann 
diese natQrliche Regulation gegebenenfalls ausgeschaltet und die Expression der Gene ertiflht 
Oder emiedrigt werden. Das Genkonstrukt kann aber auch einfacher aufgebaut sein, das heiBt 
es werden keine zusatzlichen Regulationssignale vor das Strukturgen insertiert und der natOrii- 
che Promoter mit seiner Regulation wird nicht entfemt Statt dessen wird die natQrliche Regulati- 
onssequenz so mutiert, dass keine Regulation mehr erfolgt und die Genexpression gesteigert 
Oder verringert wird. Die NukleinsSuresequenzen kdnnen in einer oder mehreren Kopien im 
Genkonstrukt enthalten sein. 

Beispiele fOr brauchbare Promotoren sind: die Promotoren, ddh, amy, lysC, dapA, lysA aus Co- 
rynebacterium glutamicum, aber auch gram-positiven Promotoren SP02 wie sie in Bacillus Sub- 
tilis and Its Closest Relatives, Sonenshein, Abraham L,Hoch, James A., Losick, Richard; ASM 
Press, District of Columbia, Washington und Patek M. Eikmanns BJ. Patek J. Sahm H. Microbio- 
logy. 1 42 1 297-309, 1 996 beschrieben sind, oder aber auch cos-, tac-, trp-, tet-, trp-tet-, lpp- f lac- 
, Ipp-lac-, iaclq-, T7-, T5-, T3-, gal-, trc-, ara-, SP6-, !ambda-PR- oder lambda-PL-Promotor, die 
vorteilhafterweise in gram-negativen Bakterien Anwendung finden. Bevorzugt ist auch die Ver- 
wendung induzierbarer Promotoren, wie z.B. licht- und insbesondere temperaturinduztierbarer 
Promotoren, wie der PrP r Promotor. Prinzipiell kdnnen alle natOrlichen Promotoren mit ihren Re- 
gulationssequenzen verwendet werden. DarOber hinaus kdnnen auch synthetische Promotoren 
vorteilhaft verwendet werden. 

Die genannten regulatorischen Sequenzen sollen die gezielte Expression der Nukleinsfiurese- 
quenzen ermdglichen. Dies kann beispielsweise je nach Wirtsorganismus bedeuten, dass das 
Gen erst nach Induktion exprimiert oder Oberexprimiert wird, oder dass es sofort exprimiert 
und/oder Oberexprimiert wird. 

Die regulatorischen Sequenzen bzw. Faktoren kdnnen dabei vorzugsweise die Expression posl- 
tiv beeinflussen und dadurch erh6hen oder emiedrigen. So kann eine VerstSrkung der regulato- 
rischen Elemente vorteilhatterweise auf der Transkriptionsebene erfolgen, indem starke 
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Transkriptionssignale wie Promotoren und/oder "Enhancer* verwendetwercJen. Daneben istaber 
auch eine VerstSrkung der Translation mSglich, indem beispielsweise die Stabilitat der mRNA 
verbessert wird. 

5 Die Herstellung einer Expressionskassette erfolgt durch Fusion eines geeigneten Promoters, 
einer geeigneten Shine-Dalgamo-Sequenz mit einer rnetH-Nukleotidsequenz sowie einem ge- 
eigneten Terminationssignal. Dazu verwendet man gangige Rekombinations- und Klonie- 
rungstechniken, wie sie beispielsweise in Current Protocols in Molecular Biology, 1993, John 
Wiley & Sons, Incorporated, New York New York, PCR Methods, Geffand, David H M Innis, Mi- 

1 0 chael A., Sninsky, John J. 1 999, Academic Press, Incorporated, California, San Diego, ., PCR 
Cloning Protocols, Methods in Molecular Biology Ser. f Vol. 1 92, 2nd ed„ Humana Press, New 
Jersey, Totowa. T. Maniatis, E.F. Fritsch und J. Sambrook, Molecular Cloning: A Laboratory Ma- 
nual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1 989) sowie in T. J. Silhavy, M.L 
Berman und L.W. Enquist, Experiments with Gene Fusions, Cold Spring Harbor Laboratory, 

1 5 Cold Spring Harbor, NY (1 984) und in Ausubel, F.M. et al., Current Protocols in Molecular Biolo- 
gy, Greene Publishing Assoc. and Wiley Interscience (1 987) beschrieben sind. 

Das rekombinante Nukleinsdurekonstrukt bzw. Genkonstrukt wird zur Expression in einem ge- 
eigneten Wirtsorganismus vorteilhafterweise in einen wirtsspezifischen Vektor insertiert, der eine 

20 optimale Expression der Gene im Wirt ermfiglicht. Vektoren sind dem Fachmann wohl bekannt 
und kflnnen beispielsweise aus "Cloning Vectors" (Pouwels P. H. et al. ( Hrsg, Elsevier, Amster- 
dam-New York-Oxford, 1 985) entnommen werden. Unter Vektoren sind auBer Plasmiden auch 
alle anderen dem Fachmann bekannten Vektoren, wie beispielsweise Phagen, Transposons, IS- 
Elemente, Phasmide, Cosmide, und lineare oder zirkulare DNA zu verstehen. Diese Vektoren 

25 k6nnen autonom im Wirtsorganismus repliziert oder chromosomal repliziert werden. 

Zur Versterkung wurden erfindungsgemaile metH Gene beispielhaft mit Hilfe von episomalen 
Plasmiden flberexprimiert. Als Plasmide eignen sich solche, die in corynefonmen Bakterien repli- 
ziert werden. Zahlreiche bekannte Plasmidvektoren, wie z. B. pZ1 (Menkel et al., Applied and 
30 Environmental Microbiology (1989) 64: 549-554), pEKExl (Eikmanns et al.. Gene 102: 93-98 
(1991)) oder pHS2-1 (Sonnen et at, Gene 107: 69-74 (1991)) beruhen auf den kryptischen 
Plasmiden pHM1519, pBL1 oder pGA1. Andere Plasmidvektoren, wie z. B. pCUK5MCS, oder 
solche, die auf pCG4 (US-A 4,489,160) oder pNG2 (Serwold-Davis et al., FEMS Microbiology 
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Letters 66, 119-124 (1990)) Oder pAG1 (US-A 5,158,891) bewhen, kGnnen in gleicher Weise 
veiwendetwerden. 

Weiterhin eignen sich auch solche Plasmidvektoren mit Hilfe derer man das Verfahren der Ge- 
namplifikation durch Integration in das Chromosom anwenden kann, so wie es beispielsweise 
von Remscheid etal. (Applied and Environmental Microbiology 60,126-132 (1994))zurDuplika« 
tion bzw. Amplifikation des hom-thrB-Operons beschrieben wurde. Bel dleser Methode wird das 
vollstendige Gen in einen Plasmidvektor kloniert, der in einem Wirt (typischerweise E. coli), nicht 
aber in C. glutamicum replizieren kann. Als Vektoren kommen beispielsweise pSUP301 (Simon 
et aL, Bio/ Technology 1,784-791 (1983)), pK18mob oder pK19mob (Schafer et al. f Gene 
145,69-73 (1994)), Bernard et al., Journal ofMolecular Biology, 234: 534-541 (1993)), pEM1 
(Schrumpf etal. 1991, Journal of Bacteriology 173: 451(M516)oderpBGS8 (Sprattetal.,1986. 
Gene 41: 337-342) in Frage. Der Plasmidvektor, der das zu amplifizierende Gen enthait, wird 
anschlieftend durch Transformation in den gewOnschten Stamm von C. glutamicum OberfOhrt 
Methoden zur Transformation sind beispielsweise beiThierbach etal. (Applied Microbiology and 
Biotechnology 29, 356-362 (1988)), Dunican und Shivnan (Biotechnology 7, 1067-1070 (1989)) 
und Tauch et al. (FEMS Microbiological Letters 123,343-347 (1994)) beschrieben. 

Enzyme kSnnen durch Mutationen in den korrespondierenden Genen derart in ihrer Aktivitat 
beeinflulit werden, dass es zu einer teilweisen oder vollstSndigen Vemngerung der Reaktions- 
geschwindigkeit der enzymatischen Reaktion kommL Beispiele fOr solche Mutationen sind dem 
Fachmann bekannt (Motoyama H. Yano H. Terasaki Y. Anazawa H. Applied & Environmental 
Microbiology. 67:3064-70, 2001 , Eikmanns BJ. Eggeling L. Sahm H. Antonie van Leeuwenhoek. 
64:145-63, 1993-94.) 

Zusatzlich kann es fOr die Produktion von schwefelhaltige Feinchemikalien, insbesondere L- 
Methionin, vorteilhaft sein, neben einer Expression bzw. VerstSrkung eines erfindungsgemaften 
metH-Gen eines oder mehrere Enzyme des Methionin-Biosyntheseweges oder eines damit as- 
soziierten (d.h. in einem funktionelleZusammenhangstehenden)Biosynthese- oder sonstigen 
Stoffwechselweges, wie des Cystein-, Lysin- oder Threonin-Stoffwechselwegs, wie insbesondere 
der Aspratatsemialdehyd-Synthese, der Glykolyse, der Anaplerotik, des Pentose-Phosphat- 
Stoffwechsels, des ZitronensSure-Zyklus oder des AminosSure-Exports zu verstSrken. 
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So kann fOr die Herstellung von schwefelhaltige Feinchemikalien, insbesondere L-Methionin, 
eines oder mehrere der folgenden Gene verstarkt sein, (d.h. z.B. in einer hOheren Kopienzahl 
vorliegen oder ein Enzym mit hdherer Aktivitat Oder Spezifitat kodieren): 
• das fQr eine Aspartatkinase kodierende Gen lysC (EP 1 108 790 A2; DNA-SEQ NO. 281), 
5 -das fOr eine Aspartat-Semialdehyd Dehydrogenasekodierende Gen asd (EP 1 108 790 A2; 
DNA-SEQ NO. 282), 

- das fOr die Glycerinaldehyd-3-Phosphat Dehydrogenase kodierende Gen gap (Eikmanns 
(1992), Journal of Bacteriology 174: 6076-6086), 

- das fOr die 3-Phosphoglycerat Kinase kodierende Gen pgk (Eikmanns (1 992), Journal of Bade- 
10 riology 174: 6076-6086), 

- das fOrdie Pyruvat Carboxylase kodierende Gen pyc (Eikmanns (1992), Journal of Bacteriology 
174:6076-6086), 

- das fOr die Triosephosphat Isomerase kodierende Gen tpi (Eikmanns (1 992), Journal of Bacte- 
riology 174: 6076-6086), 

15 - das fOr die Homoserin O-Acetyltransferase kodierende Gen metA (EP 1 1 08 790 A2; DNA-SEQ 
NO. 725), 

- das fOr die Cystahionin-gamma-Synthase kodierende Gen metB (EP 1 1 08 790 A2; DNA-SEQ 
NO. 3491), 

- das fOr die Cystahionin-gamma-Lyase kodierende Gen metC (EP 1 108 790 A2; DNA-SEQ NO. 
20 3061), 

- das fOr die Serin-Hydroxymethyltransferase kodierende Gen glyA (EP 1 1 08 790 A2; DNA-SEQ 
NO. 1110), 

- das fur die O-Acetylhomoserin-Sulfhydrylase kodierende Gen metY (EP 1 108 790 A2; DNA- 
SEQ NO, 726), 

25 - das fOr die Methylentetrahydrofolat-Reduktase kodierende Gen metF (EP 1 1 08 790 A2; DNA- 
SEQ NO. 2379), 

- das fQr die Phosphoserin-Aminotransferase kodierende Gen serC (EP 1 108 790 A2; DNA- 
SEQ NO. 928) 

- eines fOr die Phosphoserin-Phosphatase kodierende Gen serB (EP 1 108 790 A2; DNA-SEQ 
30 NO. 334, DNA-SEQ NO. 467, DNA-SEQ NO. 2767) 

- das fOr die Serine Acetyl-Transferase kodierende Gen cysE (EP 1 1 08 790 A2; DNA-SEQ NO. 
2818) 



WO 03/087386 PCT/EP03/04010 



24 

- das fQr eine Homoserin-Dehydrogenase kodierende Gen horn (EP 1 108 790 A2; DNA-SEQ . 
NO. 1306) 

So kann fOrdie Herstellung von schwefelhaltige Feinchemikalien, insbesondere L-Methionin, in 
5 coryneformen Bakterien, vorteilhafl sein, gleichzeitig wenigstens eines der nachfolgenden Gene 
zu mutieren, insbesondere so, dass die korrespondierenden Proteine, verglichen mit nicht mu- 
tierten Proteinen, in geringerem Mafie oder nicht durch einen Stoflwechselmetaboliten in ihrer 
Aktivitat beeinflusst werden: 

10 - das fOr eine Aspartatkinase kodierende Gen lysC (EP 1 1 08 790 A2; DNA-SEQ NO. 281 ), 

- das fOr die Pyruvat Carboxylase kodierende Gen pyc (Eikmanns (1 992), Journal of Bacteriology 
174:6076-6086), 

- das fur die Homoserin O-Acetyltransferase kodierende Gen metA (EP 1 108 790 A2; DNA-SEQ 
NO. 725), 

15 - das fOr die Cystahionin-gamma-Synthase kodierende Gen metB (EP 1 1 08 790 A2; DNA-SEQ 
NO. 3491), 

- das fOr die Cystahionin-gamma-Lyase kodierende Gen metC (EP 1 108 790 A2; DNA-SEQ NO. 
3061), 

- das fOr die Serin-Hydroxymethyltransferase kodierende Gen gtyA (EP 1 1 08 790 A2; DNA-SEQ 
20 NO. 1110), 

- das fOr die O-Acetylhomoserin-Sulfhydrylase kodierende Gen metY (EP 1 108 790 A2; DNA- 
SEQ NO. 726), 

- das fOr die Methylentetrahydrofolat-Reduktase kodierende Gen metF (EP 1 108 790 A2; DNA- 
SEQ NO. 2379), 

25 - das fur die Phosphoserin-Aminotransferase kodierende Gen serC (EP 1 108 790 A2; DNA- 
SEQ NO. 928) 

- eines fur die Phosphoserin-Phosphatase kodierende Gen serB (EP 1 1 08 790 A2; DNA-SEQ 
NO. 334, DNA-SEQ NO. 467, DNA-SEQ NO. 2767) 

- das fOr die Serine Acetyl-Transferase kodierende Gen cysE (EP 1 1 08 790 A2; DNA-SEQ NO. 
30 2818) 

- das fOr eine Homoserin-Dehydrogenase kodierende Gen horn (EP 1 108 790 A2; DNA-SEQ 
NO. 1306) 
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Weiterhin kann es fQr die Produktion von schwefelhaltige Feinchemikalien, insbesondere L- 
Methionin, vorteilhaft sein, zusatzlich zur Expression bzw. VerstSrkurig eines der erfindungsge- 
rnallen metH-Gene eines oder mehrere der folgenden Gene abzuschwSchen, insbesondere 
deren Expression zu verringem, oder ayszuschalten: 

5 

• das fQr die Homoserine-Kinase kodierende Gen thrB (EP 1 108 790 A2; DNA-SEQ NO. 3453) 

- das fOr die Threonin Dehydratase kodierende Gen ilvA (EP 1 108 790 A2; DNA-SEQ NO, 
2328) 

- das fOr die Threonin Synthase kodierende Gen thrC (EP 1 108 790 A2; DNA-SEQ NO. 3486) 
10 - das fOr die Meso-Diaminopimelat D-Dehydrogenase kodierende Gen ddh (EP 1 1 08 790 A2; 

DNA-SEQ NO. 3494) 

- das fur die Phosphoenolpyruvat-Carboxykinase kodierende Gen pck (EP 1 1 08 790 A2; DNA- 
SEQ NO. 3157) 

- das fOr die Glucose-6-Phosphat-6-lsomerase kodierende Gen pgi (EP 1 108 790 A2; DNA- 
15 SEQNO.950) 

- das fOr die Pyruvat-Oxidase kodierende Gen poxB (EP 1 108 790 A2; DNA-SEQ NO. 2873) 

- das fOr die Dihydrodipicolinat Synthase kodiemde Gen dapA(EP 1 108 790 A2; DNA-SEQ NO. 
3476) 

- das fQr die Dihydrodipicolinat Reduktase kodiemde Gen dapB (EP 1 108 790 A2; DNA-SEQ 
20 NO. 3477) 

- das fQr die Diaminopicolinat Decarboxylase kodiemde Gen lysA (EP 1 1 08 790 A2; DNA-SEQ 
NO. 3451) 

Weiterhin kann es fOr die Produktion von schwefelhaltige Feinchemikalien, insbesondere L- 
25 Methionin, vorteilhaft sein, zusatzlich zur Expression bzw. VerstSrkung eines der erfindungsge- 
mSBen metH-Gene in Coryneformen Bakterien gleichzeitig wenigstens eines der folgenden Ge- 
ne so zu mutieren, dass die enzymatische AktivitSt des korrespondierenden Proteins teilweise 
oder vollst§ndig verringert wird: 

30 - das fur die Homoserine-Kinase kodierende Gen thrB (EP 1 1 08 790 A2; DNA-SEQ NO. 3453) 

- das fOr die Threonin Dehydratase kodierende Gen ilvA (EP 1 108 790 A2; DNA-SEQ NO. 
2328) 

- das fOr die Threonin Synthase kodierende Gen thrC (EP 1 108 790 A2; DNA-SEQ NO. 3486) 
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- das fur die Meso-Diaminopimelat D-Dehydrogenase kodierende Gen ddh (EP 1 108 790 A2; , 
DNA-SEQ NO. 3494) 

- das fOr die Phosphoenolpyaivat-Carboxykinase kodierende Gen pck (EP 1 1 08 790 A2; DNA- 
SEQ NO. 3157) 

5 - das fOr die Glucose-6-Phosphat-6-lsomerase kodierende Gen pgi (EP 1 108 790 A2; DNA- 
SEQ NO. 950) 

- das fOr die Pyruvat-Oxidase kodierende Gen poxB (EP 1 108 790 A2; DNA-SEQ NO. 2873) 

- das fOr die Dihydrodipicolinat Synthase kodiemde Gen dapA(EP 1 1 08 790 A2; DNA-SEQ NO. 
3476) 

10 - das fOr die Dihydrodipicolinat Reduktase kodiemde Gen dapB (EP 1 1 08 790 A2; DNA-SEQ 
NO. 3477) 

- das fOr die Diaminopicolinat Decarboxylase kodiemde Gen lysA (EP 1 108 790 A2; DNA-SEQ 
NO. 3451) 

15 Weiterhin kann es fOr die Produktion von schwefelhaltige Feinchemikalien, insbesondere L- 
Methionin, vorteilhaft sein, neben der Expression bzw. Verstarfcung eines erfindungsgemfi&en 
metH-Gens unerwOnschte Nebenreaktionen auszuschalten, welche beispielsweise die Ausbeute 
an der Feichchemikalie verringem (Nakayama: "Breeding of Amino Acid Producing Microorga- 
nisms", in: Overproduction of Microbial Products, Krumphanzl, Sikyta, Vanek (eds.), Academic 

20 Press, London, UK, 1 982). 

Die erfindungsgemad hergestellten Mikroorganismen kflnnen kontinuieriich oderdiskontinuler- 
lich im batch- Verfahren (Satzkultivierung) oder im fed batch (Zulaufverfahren) oder repeated fed 
batch Verfahren (repetitives Zulaufverfahren) zur Produktion von schwefelhaltige Feinchemika- 
25 lien, insbesondere L-Methionin, kultiviert werden. Eine Zusammenfassung Ober bekannte Kulti- 
vierungsmethoden ist im Lehrbuch von Chmiel (BioprozelJtechnik 1 . EinfOhrung in die Bioverfah- 
renstechnik (Gustav Fischer Vertag, Stuttgart, 1991 )) oder im Lehrbuch von Storhas (Bioreakto- 
ren und periphere Einrichtungen (Vieweg Verlag, Braunschweig/Wiesbaden, 1994))zufinden. 

30 Das zu verwendende Kulturmedium hat in geeigneter Weise den AnsprOchen der jeweiligen 
StSmme zu genugen. Beschreibungen von Kulturmedien verschiedener Mikroorganismen sind 
im Handbuch "Manual of Methods fOr General Bacteriology" der American Society fOr Bacterio- 
logy (Washington D. C, USA, 1981) enthalten. 
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Diese erfindungsgemSfi einsetzbaren Medien umfassen gewShnlich eine oder mehrere Koh- 
lenstoffquellen, Stickstoffquellen, anorganische Salze, Vitamine und/oder Spurenelemente. 

5 Bevorzugte Kohlenstoffquellen sind Zucker, wie Mono-, Dh oder Polysaccharide. Sehr gute Koh- 
lenstoffquellen sind beispielsweise Glucose, Fructose, Mannose, Galactose, Ribose, Sorbose, 
Ribulose, Lactose, Maltose, Saccharose, Raffinose, Starke oder Cellulose. Man kann Zucker 
auch Ober komplexe Verbindungen, wie Melassen, oder andere Nebenprodukte der Zucker- 
Raffinierung zu den Medien geben. Es kann auch vorteilhaftsein, Gemische verschiedener Koh- 
10 lenstoffquellen zuzugeben. Andere mogliche Kohlenstoffquellen sind Ole und Fette wie z. B. 
Sojafll. Sonnenblumen6l. EndnulJGI und Kokosfett, FettsSuren wie z. B. Palmitinsaure, Stearin- 
sSure oder LinolsSure, Alkohole wie z. B. Glycerin, Methanol oder Ethanol und organische SSu- 
ren wie z. B. EssigsSure oder MilchsSure. 

1 5 Stickstoffquellen sind gewflhnlich organische oder anorganische Stickstoffverbindungen oder 
Materialien, die diese Verbindungen enthalten. Beispielhafte Stickstoffquellen umfassen Ammo- 
niak-Gas oder Ammoniumsalze, wie Ammoniumsulfat, Ammoniumchlorid, Ammoniumphosphat, 
Ammoniumcarbonat oder Ammoniumnitrat, Nitrate, Hamstoff, AminosSuren oder komplexe 
Stickstoffquellen, wie Maisquellwasser, Sojamehl, Sojaprotein, Hefeextrakt, Fleischextrakt und 

20 andere. Die Stickstoffquellen k6nnen einzeln oder als Mischung verwendet werden. 

Anorganische Salzverbindungen, die in den Medien enthalten sein kflnnen, umfassen die Chlo- 
rid-, Phosphor- oder Sulfatsalze von Calcium, Magnesium, Natrium, Kobalt, MolybdSn, Kalium, 
Mangan, Zink, Kupfer und Eisen 
25 . 

Als Schwefelquelle fOr die Herstellung von schwefelhaltigen Feinchemikalien, insbesondere von 
Methionin, kGnnen anorganische schwefelhaltige Verbindungen wie beispielsweise Sulfate, Sulfi- 
te, Dithionite, Tetrathionate, Thiosulfate, Sulfide aber auch organische Schwefelverbindungen, 
wie Mercaptane und Thiole, verwendet werden. 

30 

Als Phosphorquelle konnen PhosphorsSure, Kaliumdihydrogenphosphat oder Dikaliumhydro- 
genphosphat oder die entsprechenden Natrium haltigen Salze verwendet werden. 
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Chelatbildner kfinnen zum Medium gegeben werden, urn die Metallionen in Ldsung zu halten. 
Besonders geeignete Chelatbildner umfassen Oihydroxyphenole, wie Catechol Oder Protocate- 
chuat, oder organische SSuren, wie Citronensflure. 

5 Die erfindungsgemSIJ eingesetzten Fermentationsmedien enthalten Qblicherweise auch andere 
Wachstumsfaktoren, wie Vitamine oder Wachstumsfdrderer, zu denen beispielsweise Biotin, 
Riboflavin, Thiamin, FolsSure, NikotinsSure, Panthothenat und Pyridoxin gehOren, Wachstums- 
faktoren und Salze stammen haufig von komplexen Medienkomponenten, wie Hefeextrakt, Me- 
lassen, Maisquellwasser und dergleichen. Dem Kulturmedium kfinnen Oberdies geeignete Vor- 

1 0 stufen zugesetzt werden. Die genaue Zusammensetzung der Medienverbindungen hSngt stark 
vom jeweiligen Experiment ab und wird fOr jeden spezifischen Fall individuell entschieden. In- 
formation uber die Medienoptimierung ist erhaitlich aus dem Lehrbuch "Applied Microbiol Physi- 
ology, A Practical Approach" (Hrsg. P.M. Rhodes, P.F. Stanbury, IRL Press (1997) S. 53-73, 
ISBN 0 19 963577 3). Wachstumsmedien lassen sich auch von kommerziellen Anbietem bezie- 

15 hen, wie Standard 1 (Merck) oder BHI (Brain heart infusion, DIFCO) und dergleichen. 

SSmtliche Medienkomponenten werden, entwederdurch Hitze (20 min bei 1,5 bar und 121 P C) 
oder durch Sterilfiltration, sterilisiert Die Komponenten kannen entweder zusammen odern&ti- 
genfalls getrennt sterilisiert werden. SSmtliche Medienkomponenten k6nnen zu Beginn der An- 
20 zucht zugegen sein oder wahlfrei kontinuierlich oder chargenweise hlnzugegeben werden. 

Die Temperatur der Kultur liegt normalerweise zwischen 1 5°C und 45 W C, vorzugsweise bei 25°C 
bis 40°C und kann wahrend des Experimentes konstant gehalten oder verandert werden. Der 
pH-Wert des Mediums sollte im Bereich von 5 bis 8,5, vorzugsweise urn 7,0 liegen. Der pH-Wert 

25 fOr die Anzucht IdBt sich wflhrend der Anzucht durch Zugabe von basische Verbindungen wie 
Natriumhydroxid, Kaliumhydroxid, Ammoniak bzw. Ammoniakwasser oder saure Verbindungen 
wie Phosphorsflure oder SchwefelsSure kontrollieren. Zur Kontrolle der Schaumentwicklung 
kdnnen Antischaummitte.l wie z. B. Fetts3urepolyglykolester, eingesetzt werden. Zur Aufrechter- 
haltung der Stability von Plasmiden konnen dem Medium geeignete selektivwirkende Stoffe, 

30 wie z. B. Antibiotika, hinzugefOgt werden. Urn aerobe Bedingungen aufrechtzuerhalten, werden 
Sauerstoff oderSauerstoff haltige Gasmischungen, wiez. B. Umgebungsluft, indie Kultur einge- 
tragen. Die Temperatur der Kultur liegt normalerweise bei 20°C bis 45°C. Die Kultur wird solange 
fortgesetzt, bis sich ein Maximum des gewunschten Produktes gebildet hat. Dieses Ziel wircl 
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normalerweise innerhaib von 10 Stunden bis 160 Stunden erretcht 

Die so erhaltenen, insbesondere L-Methionin enthaltenden, FermentationsbrOhen haben Obll- 
cherweise eine Trockenmasse von 7,5 bis 25 Gew.-%. 

5 

Vorteilhafl ist aulierdem auch, wenn die Fermentation zumindest am Ende, insbesondere jedoch 
Ober mindestens 30% der Fermentationsdauer zuckerlimitiert gefahren wird. Das heiBt, dass 
wShrend dieser Zeit die Konzentration an verwertbarem Zucker im Fermentationsmedium auf £ 0 
bis 3 g/l gehaiten. beziehungsweise abgesenkt wird. 

Die FermentationsbrOhe wird anschlieBend weiterverarbeitet. Je nach Anforderung kann die 
Biomasse ganz Oder teilweise durch Separationsmethoden, wie z. B. Zentrifugation, Filtration, 
Dekantieren oder einer Kombination dieser Methoden aus der FermentationsbrOhe entfemtoder 
vollstfindig in ihr belassen werden. 

15 

AnschlieBend kann die FermentationsbrOhe mit bekannten Methoden, wie z. B. mit Hilfe eines 
Rotationsverdampfers, DOnnschichtverdampfers, Fallfilmverdampfers, durch Umkehrosmose, 
Oder durch Nanofiltration, eingedickt beziehungsweise aufkonzentriert werden. Diese aufkon- 
zentrierte FermentationsbrOhe kann anschlieBend durch Gefriertrocknung, SprOhtrocknung, 
20 Spruhgranulation oder durch anderweitige Verfahren aufgearbeitet werden. 

Es ist aber auch maglich die schwefelhaltigen Feinchemikalien, insbesonder L-Methionin, weiter 
aufeureinigen. Hierzu wird die produkthaltige BrOhe nach dem Abtrennen der Biomasse einer 
Chromatographie mit einem geeigneten Harz unterworfen, wobei das gewOnschte Produkt oder 

25 die Verunreinigungen ganz oder teilweise auf dem Chromatographieharz zurfickgehalten wer- 
den. Diese Chromatographieschritte kfinnen nStigenfalls wiedertiolt werden, wobei die gleichen 
oder andere Chromatographieharze verwendet werden. Der Fachmann ist in der Auswahl der 
geeigneten Chromatographieharze und ihrer wirksamsten Anwendung bewandert Das gereinig- 
te Produkt kann durch Filtration oder Ultrafiltration konzentriert und bei einer Temperaturaufbe- 

30 wahrt werden, bei der die Stabilitat des Produktes maximal ist 

Die Identitat und Reinheit der isolierten Verbindung(en) kann durch Techniken des Standes der 
Technik besfimmt werden. Diese umfassen Hochleistungs-FIOssigkeitschromatographie (HPLC), 
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spektroskopische Verfahren, FSrbeverfahren, DOnnschichtchrornatographie, NIRS, Enzymtest 
Oder mikrobiologische Tests. Diese Analyseverfahren sind zusammengefafjt in: Patek et al. 
(1994) Appl. Environ. Microbiol. 60:133-140; Malakhova etal. (1996) Biotekhnologiya 11 27-32; 
und Schmidt et al. (1 998) Bioprocess Engineer. 19:67-70. Ulmann's Encyclopedia of Industrial 
5 Chemistry (1996) Bd. A27, VCH: Weinheim, S. 89-90, S. 521-540, S. 540-547, S. 559-566, 575- 
581 und S. 581-587; Michal, G (1999) Biochemical Pathways: An Atlas of Biochemistry and Mo- 
lecular Biology, John Wiley and Sons; Fallon, A. et al. (1987) Applications of HPLC in Bioche- 
mistry in: Laboratory Techniques in Biochemistry and Molecular Biology, Bd. 17. 

1 0 Die Erfindung wird nun anhand der folgenden nicht-limitierenden Beisplele nfiher beschrieben: 

Belsplel 1: Konstruktlon von pCLiK5MCS 

ZunSchst wurden Ampicillinresistenz und Replikationsursprung des Vektors pBR322 mit den 
15 Oligonukleotiden p1.3 (SEQ ID NO:53) und p2.3 (SEQ ID NO:54) mit Hilfe der Polymerase- 
Kettenreaktion (PCR) amplifiziert 

p1.3(SEQIDNO:53) 

5 t -CCCGGGATCCGCTAGCGGCGCGCCGGCCGGCCCGGTGTGAAATACCGCACAG-3 , 

20 

p2.3 (SEQ ID NO:54) 

S'-TCTAGACTCGAGCGGCCGCGGCCGGCCTTTAAATTGAAGACGAAAGGGCCTCG-S' 

Neben den zu pBR322 komplementSren Sequenzen, enthSIt das Oligonukteotid p1.3 (SEQ ID 
25 NO:53) in 5-3' Richtung die Schnittstellen fur die Restriktionsendonukleasen Smal, BamHI, Nhel 
und AscI und das Oligonukleotid p2.3 (SEQ ID NO:54) in 5'-3' Richtung die Schnittstellen fOr 
die RestriWionsendonukleasen Xbal, Xhol, Notl und Dral. Die PCR Reaktion wurde nach Stan- 
dardmethode wie Innis etal. (PCR Protocols. A Guide to Methods and Applications, Academic 
Press (1 990)) mit PfuTurbo Polymerase (Stratagene, La Jolla, USA) durchgefuhrt. Das erhalte- 
30 ne DNA Fragment mit einer GrbBe von ungefiihr 2,1 kb wurde mit dem GFX™PCR, DNA and 
Gel Band Purification Kit (Amersham Pharmacia, Freiburg) nach Angaben des Herstellers gerel- 
nigt Die stumpfen Enden des DNA-Fragmentes wurden mit dem Rapid DNA Ligation Kit (Roche 
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Diagnostics, Mannheim) nach Angaben des Herstellers miteinander ligiert und der Ligationsan- 
satz nach Standardmethoden wie in Sambrook et al. (Molecular Cloning. A Laboratory Manual, 
Cold Spring Harbor, beschrieben(1 989)), in kompetente E.coli XL-1 Blue (Stratagene, La Jolla, 
USA) transformiert. Eine Selektion auf Plasmid tragende Zellen wurde durch das Ausplattieren 
5 auf Ampicillin (50pg/ml) haltigen LB Agar (Lennox, 1955, Virology, 1:190) erreicht 

Die Plasmld-DNA eines individuellen Klons wurde mit dem Qiaprep Spin Miniprep Kit (Qiagen, 
Hilden) nach Angaben des Herstellers isoliert und Qber Restriktionsverdaus Oberprtft Das so 
ertialtene Plasmid erhSIt den Namen pCLiKI . 

10 

Ausgehend vom Plasmid pWLT1 (Liebl et al., 1 992) als Template fOr eine PCR Reaktion wurde 
mit den Oligonukleotiden neol (SEQ ID NO:55) und neo2 (SEQ ID NO:56) eine Kanamycin- 
Resistenzcassette amplifiziert 

15 neol (SEQ ID NO:55): 

S-GAGATCTAGACCCGGGGATCCGCTAGCGGGCTGCTAAAGGAAGCGGA-S* 

neo2 (SEQ ID NO:56): 

5'-GAGAGGCGCGCCGCTAGCGTGGGCGAAGAACTCCAGCA-3' 

20 

Neben den zu pWLT1 komplementSren Sequenzen, enthait das Oligonukleotid neol in 5-3' 
Richtung die Schnittstellen fOr die Restriktionsendonukleasen Xbal, Smal, BamHI, Nhel und das 
Oligonukleotid neo2 (SEQ ID NO:56) in V-V Richtung die Schnittstellen fQr die Restriktionsen- 
donukleasen AscI und Nhel. Die PCR Reaktion wurde nach Standardmethode wie Innis et al. 

25 (PCR Protocols. A Guide to Methods and Applications, Academic Press (1 990)) mit PfuTurbo 
Polymerase (Stratagene, La Jolla, USA) durchgefuhrt. Das erhaltene DNA Fragment mit einer 
Grolie von ungefShr 1 ,3 kb wurde mit dem GFX™PCR, DNA and Gel Band Purification Kit (A- 
mersham Pharmacia, Freiburg) nach Angaben des Herstellers gereinigt. Das DNA-Fragment 
wurde mit den Restriktionsendonukleasen Xbal und AscI (New England Biolabs, Beverly, USA) 

30 geschnitten und im Anschluli daran emeut mit dem GFX™PCR, DNA and Gel Band Purification 
Kit (Amersham Pharmacia, Freiburg) nach Angaben des Herstellers gereinigt Der Vektor 
pCLiKI wurde ebenfalls mit den Restriktionsendonukleasen Xbal und AscI geschnitten und mit 
alkalischer Phosphatase (Roche Diagnostics, Mannheim) nach Angaben des Herstellers 
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dephosphoryliert Nach Elektrophorese in einem 0,8%igen Agarosegel wurde der linearisierte 
Vektor (ca. 2, 1 kb) mit dem GFX™PCR, DNA and Gel Band Purification Kit (Amereham Pharma- 
cia, Freiburg) nach Angaben des Herstellers isoliert Dieses Vektor-Fragment wurde mit Hilfe 
des Rapid DNA Ligation Kit (Roche Diagnostics, Mannheim) nach Angaben des Herstellers mit 
5 dem geschnittenen PGR Fragment ligiert und der Ligationsansatz nach Standardmethoden wie 
in Sambrook et al. (Molecular Cloning. A Laboratory Manual, Cold Spring Harbor, beschrie- 
ben(1 989)), in kompetente Ecoli XL-1 Blue (Stratagene, La Jotia, USA) transformiert Eine Selek- 
tion auf Plasmid tragende Zellen wurde durch das Ausplattieren auf Ampicillin (50pg/ml) und 
Kanamycin (20pg/ml) haltigen LB Agar (Lennox, 1955, Virology, 1:190) erreicht 

10 

Die Plasmid-DNA eines individuellen Klons wurde mit dem Qiaprep Spin Miniprep Kit (Qiagen, 
Hilden) nach Angaben des Herstellers isoliert und Ober Restriktionsverdaus OberprOft Das so 
erhaltene Plasmid erhait den Namen pCUK2. 

1 5 Der Vektor pCLiK2 wurde mit der Restriktionsendonuklease Dral (New England Biolabs, Beverly, 
USA) geschnitten. Nach Elektrophorese in einem 0,8%igen Agarosegel wurde ein ca. 2,3 kb 
groBes Vektorfragment mit dem GFX™PCR, DNA and Gel Band Purification Kit (Amersham 
Pharmacia, Freiburg) nach Angaben des Herstellers isoliert Dieses Vektor-Fragment wurde mit 
Hilfe des Rapid DNA Ligation Kit (Roche Diagnostics, Mannheim) nach Angaben des Herstellers 

20 religiert und der Ligationsansatz nach Standardmethoden wie in Sambrook et al. (Molecular Clo- 
ning. A Laboratory Manual, Cold Spring Harbor, beschrieben (1989)), in kompetente E.coli XL- 
1Blue (Stratagene, La Jolla, USA) transformiert Eine Selektion auf Plasmid tragende Zellen 
wurde durch das Ausplattieren auf Kanamycin (20pg/ml) haltigen LB Agar (Lennox, 1955, Viro- 
logy, 1:190) erreicht 

25 

Die Plasmid-DNA eines individuellen Klons wurde mit dem Qiaprep Spin Miniprep Kit (Qiagen, 
Hilden) nach Angaben des Herstellers isoliert und Ober Restriktionsverdaus OberprOft. Das so 
erhaltene Plasmid erhalt den Namen pCLiK3. 

30 Ausgehend vom Plasmid pWLQ2 (Liebl et at, 1 992) als Template fQr eine PCR Reakfion wurde 
mit den Oligonukleotiden cg1 ((SEQ ID NO:57) und cg2 (SEQ ID NO:58) der Replikation- 
sursprung pHM1519 amplifiziert 
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cg1 (SEQ ID NO:57): 

S'-GAGAGGGCGGCCGCGCAAAGTCCCGCTTCGTGAA-y 
cg2(SEQIDNO:58): 

5-GAGAGGGCGGCCGCTCAAGTCGGTCAAGCCACGC-3 4 

Neben den zu pWLQ2 komplementSren Sequenzen, enthalten die Oligonukleotide cg1 (SEQ ID 
NO:57) und cg2 (SEQ ID NO:58) Schnittstellen fiir die Restriktionsendonuklease Notl. Die PCR 
Reaktion wurde nach Standardmethode wie Innis et al. (PCR Protocols. A Guide to Methods and 
Applications, Academic Press (1990)) mit PfuTurbo Polymerase (Stratagene, La Jolla, USA) 
durchgef QhrL Das erhaltene DNA Fragment mit einer Gr5Be von ungefahr 2,7 kb wurde mit dem 
GFX™PCR, DNA and Gel Band Purification Kit (Amersham Pharmacia, Freiburg) nach Angaben 
des Herstellers gereinigt Das DNA-Fragment wurde mit der Restriktionsendonuklease Notl 
(New England Biolabs, Beverly, USA) geschnitten und im AnschluU daran emeut mit dem 
GFX™PCR, DNA and Gel Band Purification Kit (Amersham Pharmacia, Freiburg) nach Angaben 
des Herstellers gereinigt Der Vektor pCUK3 wurde ebenfalls mit der Restriktionsendonuklease 
Notl geschnitten und mit alkalischer Phosphatase (Roche Diagnostics, Mannheim)) nach Anga- 
ben des Herstellers dephosphoryliert Nach Elektrophorese in einem 0,8%igen Agarosegel wur- 
de der linearisierte Vektor (ca. 2,3kb) mit dem GFX™PCR, DNA and Gel Band Purification Kit 
(Amersham Pharmacia, Freiburg) nach Angaben des Herstellers isoliert Dieses Vektor- 
Fragment wurde mit Hilfe des Rapid DNA Ligation Kit (Roche Diagnostics, Mannheim) nach An- 
gaben des Herstellers mit dem geschnittenen PCR Fragment liglert und der Ligationsansatz 
nach Standardmethoden wie in Sambrook et al. (Molecular Cloning. A Laboratory Manual, Cold 
Spring Harbor, beschrieben(1 989)), in kompetente E.coli XL-1 Blue (Stratagene, La Jolla, USA) 
transformiert. Eine Selektion auf Plasmid tragende Zellen wurde durch das Ausplattieren auf 
Kanamycin (20pg/ml) haltigen LB Agar (Lennox, 1955, Virology, 1:190) erreicht. 

Die Plasmid-DNA eines individuellen Klons wurde mit dem Qiaprep Spin Miniprep Kit (Qiagen, 
Hilden) nach Angaben des Herstellers isoliert und Qber Restriktionsverdaus OberprOft. Das so 
erhaltene Plasmid erhait den Namen pCLiK5. 

FQr die Erweiterung von pCLik5 urn eine .multiple cloning site" (MCS) wurden die beide synthefi- 
schen, weitestgehend komplementSren Oligonukleotide HS445 ((SEQ ID NO:59) und HS446 
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(SEQ ID NO:60), die Schnittstellen fOr die Restriktionsendonukleasen Swal, Xhol, Aatl, Apal, 
Asp718, Mlul, Ndel, Spel, EcoRV, Sail, Clal, BamHI, Xbal und Smal enthalten, durch gemein- 
sames ertiitzen auf 95°C und langsames abkuhlen zu einem doppelstrSngigen DNA-Fragment 
vereinigt 

HS445(SEQIDNO:59): 

S-TCGAATTTAAATCTCGAGAGGCCTGACGTCGGGCCCGGTACCACGCGTCATATGACTAG 

TTCGGACCTAGGGATATCGTCGACATCGATGCTCTTCTGCGTTAATTAACAATTGGGATCC 
TCTAGACCCGGGATTTAAAT-3' 

HS446 (SEQ ID NO:60): 

5-GATCATTTAAATCCCGGGTCTAGAGGATCCCMTTGTTAATTAACGCAGAAGAGCATCGA 

TGTCGACGATATCCCTAGGTCCGAACTAGTCATATGACGCGTGGTACCGGGCCCGACGTC 
AGGCCTCTCGAGATTTAAAT-3 1 

Der Vektor pCLiKS wurde mit den Restriktionsendonuklease Xhol und BamHI (New England 
Biolabs, Beverly, USA) geschnitten und mit alkalischer Phosphatase (I (Roche Diagnostics, 
Mannheim)) nach Angaben des Herstellers dephosphoryliert Nach Elektrophorese in einem 
0,8%igen Agarosegel wurde der linearisierte Vektor (ca. 5,0 kb) mit dem GFX™PCR, DNA and 
Gel Band Purification Kit (Amersham Pharmacia, Freiburg) nach Angaben des Herstellers iso- 
liert Dieses Vektor-Fragment wurde mit Hilfe des Rapid DNA Ligation Kit (Roche Diagnostics, 
Mannheim) nach Angaben des Herstellers mit dem synthetischen DoppelsWngigen DNA- 
Fragment ligiert und der Ligationsansatz nach Standardmethoden wie in Sambrook et al. (Mole- 
cular Cloning. A Laboratory Manual, Cold Spring Harbor, beschrieben(1989)), in kompetente 
E.coli XL-1 Blue (Stratagene, La Jolla, USA) transformiert Eine Selektion auf Plasmid tragende 
Zelien wurde durch das Ausplattieren auf Kanamycin (20pg/ml) haWgen LB Agar (Lennox, 1 955, 
Virology, 1:190) errcichL 

Die Plasmid-DNA eines individuellen Klons wurde mit dem Qiaprep Spin Miniprep Kit (Qiagen, 
Hilden) nach Angaben des Herstellers isoliert und Ober Restriktionsverdaus OberprOft. Das so 
erhaltene Plasmid erhait den Namen pCLiKSMCS. 

Sequenzierungsreaktionen wurden nach Sanger et al. (1977) Proceedings of the National Aca- 
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demy of Sciences USA 74:5463-5467 durchgefOhrt. Die Sequenzierreaktionen wurden mittels 
ABI Prism 377 (PE Applied Biosystems, Weiterstadt) aufgetrennt und ausgewertet 

Das entstandene Plasmid pCLiK5MCS ist als SEQ ID NO: 63 aufgefOhrt 

5 

Beispiel 2: Konstruktion von pCLIKSMCS integrate sacB 

Ausgehend vom Plasmid pK1 9mob (Schafer et al. f Gene 1 45,69-73(1 994)) als Template fGr eine 
PCR Reaktion wurde mitden Oligonukleotiden BK1732 und BK1733 das Bacillus subtilis sacB 
1 0 Gen (kodierend fGr Levan Sucrase) ampliftziert 

BK1732(SEQIDNO:61): 

5 4 -GAGAGCGGCCGCCGATCCTTTTTAACCCATCAC-3' 

15 BK1733(SEQIDNO:62): 

S'-AGGAGCGGCCGCCATCGGCATTTTCTTTTGCG-y 

Neben den zu pEK19mobsac komplementaren Sequenzen, enthalten die Qligonukleotide 
BK1732 und BK1733 Schnittstellen fQr die Restriktionsendonuklease Notl. Die PCR Reaktion 

20 wurde nach Standardmethode wie Innis et al. (PCR Protocols. A Guide to Methods and Applica- 
tions, Academic Press (1990)) mit PfuTurbo Polymerase (Stratagene, La Jolla, USA) durchge- 
fOhrt. Das erhaltene DNA Fragment mit einer Gr6Be von ungefahr 1,9 kb wurde mit dem 
GFX™PCR, DNA and Gel Band Purification Kit (Amersham Pharmacia, Freiburg) nach Angaben 
des Herstellers gereinigt Das DNA-Fragment wurde mit der Restriktionsendonuklease Notl 

25 (New England Biolabs, Beverly, USA) geschnitten und im AnschluB daran emeut mit dem 
GFX™PCR, DNA and Gel Band Purification Kit (Amersham Pharmacia, Freiburg) nach Angaben 
des Herstellers gereinigt. 

Der Vektor pCLiKSMCS (hergestellt gemas Beispiel 1 ) wurde ebenfalls mit der Restriktionsen- 
30 donuklease Notl geschnitten und mit alkalischer Phosphatase (I (Roche Diagnostics, Mann- 
heim)) nach Angaben des Herstellers dephosphoryliert Nach Elektrophoresein einem0,8%igen 
Agarosegel wurde ein ungefahr 2,4 kb grades Vektorfragment mit dem GFX™PCR, DNA and 
Gel Band Purification Kit (Amersham Pharmacia, Freiburg) nach Angaben des Herstellers iso- 



WO 03/087386 PCT/EP03/04010 

36 

liert. Dieses Vektor-Fragment wurde mit Hilfe des Rapid DNA Ligation Kit (Roche Diagnostics, 
Mannheim) nach Angaben des Herstellers mit dem geschnittenen PCR Fragment ligiert und der 
Ligationsansatz nach Standardmethoden wie in Sambrook et al. (Molecular Cloning. A Laborato- 
ry Manual, Cold Spring Harbor, beschrieben(1989)) f in kompetente E.coliXL-1Blue (Stratagene, 
5 La Jolla, USA) transformiert. Eine Selektion auf Plasmid tragende Zellen wurde durch das Aus- 
plattieren auf Kanamycin (20pg/ml) haltigen LB Agar (Lennox, 1955, Virology, 1:190) erreicht 

Die Plasmid-DNA eines individuellen Klons wurde mit dem Qiaprep Spin Miniprep Kit (Qiagen, 
Hilden) nach Angaben des Herstellers isoliert und Ober Restriktionsverdaus Oberprttft Das so 
1 0 erhaltene Plasmid erhait den Namen pCLiKSMCS integrate sacB. 

Sequenzierungsreaktionen wurden nach Sanger et al. (1977) Proceedings of the National Aca- 
demy of Sciences USA 74:5463-5467 durchgefOhrt Die Sequenzierreaktionen wurden mittels 
ABI Prism 377 (PE Applied Biosystems, Weiterstadt) aufgetrennt und ausgewertet 

15 

Das entstandene Plasmid pCLiKSMCS integrate sacB ist als SEQ ID NO: 64 aufgefOhrt. 

Weitere Vektoren die zurerfindungsgemSften Expression Oder Oberproduktion von metH-Genen 
geeignet sind, kflnnen in analoger Weise herstellt werden. 

20 

In den folgenden Beispielen 3 bis 8 wird die schrittweise Konstiuktion eines verbesserten Me- 
thionin-produzierenden Stammes mit der Bezeichnung LU1479 lysC 311 ile ET-16 pC Phsdh 
metH_Sc beschrieben. 

25 Beispiel 3: Isolierung des lysC gens aus dem C. glutamicum Stamm LU1479 

Im ersten Schritt der Stammkonstruktion soli ein allelischer Austausch des lysC Wildtypgens, 
kodierend fur das Enzym Aspartatkinase, in C. glutamicum ATCC1 3032, im folgenden LU1479 
genannt, durchgefOhrt werden. Dabei soli im LysC Gen ein Nukleotidaustausch durchgefOhrt 
30 werden, so dass im resultierenden Protein die AminosSure Thr an der Position 31 1 durch die 
AminosSure Ile ausgetauscht isL 



Ausgehend von der chromosomalen DNA aus LU1479 als Template fur eine PCR Reaktion 
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wurde mit den Oligonukleotidprimem SEQ ID NO:65 und SEQ ID NO:66 lysC mit Hilfe des Pfu- 
Turbo PCR Systems (Stratagene USA) nach Angaben des Herstellers amplifiziert Chromo- 
somale DNA aus C. glutamicum ATCC 13032 wurde nach Tauch etal. (1995) Plasmid 33:168- 
179 oder Eikmanns etal. (1994) Microbiology 140:1817-1828 prflpariert Das ampfifizierte Frag- 
5 ment wird an seinem S'-Ende von einem Sail Restriktionsschnitt und an seinem 3'-Ende von 
einem Mlul Restriktionsschnitt flankiert Vor der Klonierung wurde das amplifizierte Fragment 
durch diese beiden Restriktionsenzyme verdaut und mit GFX™PCR, DNA and Gel Band Purifi- 
cation Kit (Amersham Pharmacia, Freiburg) aufgereinigt. 

10 SEQIDN0:65 

S'-GAGAGAGAGACGCGTCCCAGTGGCTGAGACGCATC -3' 

~ SEQID NO:66 

5 t -CTCTCTCTGTCGACGAATTCAATCTTACGGCCTG-3 , 

15 

Das erhaltenen Polynukleotid wurde Qber die Sail und Mlul Restriktionsschnitte in pCLIKS MCS 
integrativ SacB (im folgenden pCIS genannt; SEQ ID NO: 64 aus Beispiel 2) kloniert und in 
E.coli XL-1 blue transformiert Eine Selektion auf Plasmid-tragende Zellen wurde durch das 
Ausplattieren auf Kanamycin (20pg/ml)-ha1tigen LB Agar (Lennox, 1955, Virology, 1:190) er- 

20 reicht. Das Plasmid wurden isoliert und durch Sequenaerung die erwartete Nukleotidsequenz 
bestatigt. Die Preparation der Plasmid-DNA wurde nach Methoden und mit Materialien der Rrma 
Quiagen durchgefOhrt. Sequenzierungsreaktionen wurden nach Sanger et al. (1977) Procee- 
dings of the National Academy of Sciences USA 74:5463-5467 durchgefOhrt. Die Sequenzier- 
reaktionen wurden mittels ABI Prism 377 (PE Applied Biosystems, Weiterstadt) aufgetrennt und 

25 ausgewertet. Das erhaltene Plasmid pCIS lysC ist als SEQ ID NO:77 aufgefOhrt 

Die Sequenz SEQ ID NO:77 umfasst die folgenden wesentlichen Teilbereiche: 



Position 


Art der Se- 
quenz 


Beschreibung 


155-1420 


CDS 1 ' 


lysC 


1974-2765 


CDS 


Kanamycin-Resistenz 
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3032-3892 


CDS 


Replikatlonsursprung/ 


(complement) 2 * 




E.coli/Plasmid pMB 



Kodierende Sequenz 
auf KomplementSrstrang 



Belsplel 4: Mutagenese des lysC Gens aus C. glutamicum 

5 Die gerichtete Mutagenese des lysC Gens aus C. glutamicum (Beispiel 3) wurde mit dem 
QuickChange Kit (Fa. Stratagene/USA) nach Angaben des Herstellers durchgefGhrt Die Muta- 
genese wurde im Plasmid pCIS lysC. SEQ ID NO:77 durchgeffihrt. FOr den Austausch von 
thr311 nach 311ile mit HHfe der Quickchange Methode (Stratagene) wurxlen folgende Oligo- 
nukleotidprimer synthetisiert 

10 

SEQIDNO:67 

S^CGGCACCACCGACATCATCTTCACCTGCCCTCGTTCCG -3' 
SEQ ID NO:68 

1 5 5*-CGGAACGAGGGCAGGTGAAGATGATGTCGGTGGTGCCG -3' 

Der Einsatz dieser Oligonukleotidprimer in der Quickchange Reaktion fOhrt in dem lysC Gen zu 
einem Austausch des Nukleotids in Position 932 (von C nach T) (vgl. SEQ ID NO:75) und im 
korrespondierenden Enzym zu einem AminosSuresubstitution in Position 311 (Thr->lle) (vgl. 
20 SEQ ID NO:76). Der resultierende AminosSureaustausch Thr31 1 lie im lysC Gen wurde nach 
Transformation in E.coli XL1-blue und Plasmidpraparation durch Sequenzierung bestatigt. Das 
Plasmid erhielt die Bezeichnung pCIS lysC thr311ile und ist als SEQ ID NO:78 aufgefOhrt. 

Die Sequenz SEQ ID NO:78 umfasst die folgenden wesentiichen Teilbereiche: 

25 



Position 


Art der Se- 
quenz 


Beschreibung 


155-1420 


CDS 1 ' 


lysC mutiert 


1974-2765 


CDS 


Kanamycin-Resistenz 



I 
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3032-3892 


CDS 


Replikationsursprung/ 


(complement) 2 * 




E.coli/Plasmid pMB 



Kodierende Sequenz 
auf KomplementSrstrang 



Das Plasmid pCIS lysC thr31 1 ile wurde in C. glutamicum LU1479 mittels Elektroporation wie bei 
5 Liebl, et al. (1 989) FEMS Microbiology Letters 53:299-303 beschrieben, transformiert. Modifikatt- 
onen des Protokolls sind in DE-A-1 0046870 beschrieben. Die chromosomale Anordnung des 
lysC-Lokus einzelner Transformanten wurde mit Standardmethoden durch Southemblot und 
Hybridisierong, wie in Sambrook et al. (1989), Molecular Cloning. A Laboratory Manual, Cold 
Spring Harbor, beschrieben, Oberprtlft Dadurch wurde sichergestellt, dass es sich bei den 
1 0 Transformanten urn solche handelt, die das transformierte Plasmid durch homologe Rekombina- 
tion am lysC-Lokus integriert haben. Nach Wachstum solcher Kolonien Ober Nacht in Medlen, 
die kein Antibiotikum enthielten, wurden die Zellen auf ein Saccharose-CM-Agarmedlum (10% 
Saccharose) ausplattiert und bei 30°C fQr 24 Stunden inkubiert 

Da das im Vektor pCIS lysC thr31 lile enthaltende sacB Gen Saccharose in ein toxisches Pro- 
1 5 dukt umwandelt, kdnnen nur solche Kolonien anwachsen, die das sacB Gen durch elnen zwei- 
ten homologen Rekombinationsschrittzwischen dem Wildtyp lysC Gen und dem mufierten Gen 
lysC thr31 Uledeletiert haben. WShrend der homologen Rekombination kann entwederdas Wild- 
typ Gen oder das mutierte Gen zusammen mit dem sacB Gen deletiert werden. Wenn das sacB 
Gen zusammen mit dem Wildtyp Gen entfemt wird. resultiert eine mutierte Transformante. 

20 

Anwachsende Kolonien wurden gepickt, und auf eine Kanamycin-sensitiven Phflnotyp hln unter- 
sucht. Klone mit deletiertem SacB Gen mOssen gleichzeitg Kanamycin-sensitives Wachstums- 
verhalten zeigen. Solche Kan-sensitiven Klone wurde im einem Schuttelkolben auf ihre Lysin- 
Produktivitat hin untersucht (siehe Beispiel 6). Zum Vergleich wurde der nichtbehandelte Stamm 
25 LU 1 479 angezogen. Klone mit einer gegenOber der Kontrolle erh6hten Lysin-Produktion wurden 
selektiert, chromosomale DNA wurde gewonnen und derentsprechende Bereich des lysC Gens 
wurde durch eine PCR-Reaktion amplifiziert und sequenziert. Ein solcher Klon mit der Eigen- 
schaft erhfthter Lysin-Synthese und nachgewiesener Mutation in lysC an der Stelle 932 wurde 
mit LU1479 lysC 31 1ile bezeichnet). 
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Beispfel 5: Herstellung Ethionin-resistenter C. glutamicum StSmme 

Imzweiten SchrittderStammkonstruktion wurde dererhaltene Stamm LU1479lysC311ile(Bei- 
5 spiel 4) behandelt, urn eine Ethionin-Resistenz (Kase, H. Nakayama K.Agr. Biol. Chem. 39 1 53- 
106 1975 L-methionine production by methionine analog-resistant mutants of Corynebacterium 
glutamicum) zu induzieren: Eine Obemachtkultur in BHI-Medium (Difco) wurde in Citratpuffer 
(50mM pH 5,5) gewaschen und bei 30°C fOr 20 min mit N-Methyl-nltrosoguanidin (10mg/ml in 
50mM Citrat pH5,5) behandelt. Nach der Behandlung mit dem chemischen Mutagen N-Methyl- 

1 0 nitrosoguanidin wurden die Zellen gewaschen (Citratpuffer 50mM pH 5,5) und auf ein Medium 
plattiert, das aus folgenden Komponenten, berechnet auf 500ml, zusammengesetzt war. 10g 
(NH<)2SO 4 .0.5g KH 2 PO 4 .0.5g K 2 HP0 4 , 0.125g MgS0 4 -7H 2 0, 21g MOPS, 50mg CaCI 2 , 15mg 
Proteokatechuat, 0,5mg Biotin, 1 mg Thiamin, 5g/l D,L-Ethionln (Sigma Chemicals Deutschland), 
pH 7,0. AuBerdem enthielt das Medium 0.5ml einer SpurensalzlOsung aus: 10g/l FeS0 4 .7H 2 0, 

15 1 g/l MnS0 4 *H 2 0, 0.1 g/l ZnSO^HA 0.02g/l CuS0 4 . 0.002g/l NiCI ? *6H 2 0, Alle Salze wurden in 
0,1 M HCI gelGst Das fertig zusammengestellte Medium wurde sterilfiltriert und nach Zugabe von 
40ml steriler 50% Glucoselflsung, mit flOssigem sterilem Agar in einer Endkonzentration von 
1 ,5% Agar versetzt und in Kulturschalen ausgegossen. 

20 Auf Platten mit dem beschriebenen Medium wurden mutagenisierte Zellen aufgebracht und 3-7 
Tage bei 30°C inkubiert. Erhaltene Klone wurden isoliert, mindestens einmal auf dem Seiekti- 
onsmedium vereinzelt und dann auf ihre Methionin-ProduktivitSt in einem SchOttelkolben in Me- 
dium II untersucht (siehe Beispiel 6 

25 Beispiel 6: Herstellung von Methionin mit dem Stamm LU1479 lysC 31 1ile ET-16. 

Die in Beispiel 5 hergestellten Stamme wurden auf einer Agar-Platte mit CM-Medium f0r2 Tag 
bei 30°C angezogen. 

CM-Agan 

30 1 0,0 g/l D-Glucose, 2,5 g/l NaCI, 2,0 g/l Harnstoff, 1 0,0 g/l Bacto Pepton (Difco), 5,0 
g/l Yeast Extract (Difco), 5,0 g/l Beef Extract (Difco), 22,0 g/l Agar (Difco), autoklaviert (20 min., 
121°C) 
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Anschliefiend wurden die Zellen von der Platte abgekratzt und in Saline resuspendiert FOr die 
Hauptkulturwurden 10 mi Medium II und 0,5 g autoklaviertes CaC0 3 (Riedel de Haen) in einem 
1 00 ml Erlenmeyerkolben mit der Zellsuspension bis zu einer OD600nm von 1 ,5 beimpft und fQr 
5 72h auf einem Orbitalschuttler mit 200 Upm bei 30°C inkubiert 

Medium II: 



40g/l Saccharose 

60g/l Melasse (auf 1 00% Zuckergehalt berechnet) 

10 10g/l (NH 4 )2S0 4 

0.4g/l MgS0 4 *7H 2 0 

0.6g/l KH 2 P0 4 
0.3mg/l~ Thiamin*HCI 

1 mg/l Biotin (aus einer 1 mg/ml steril filtrierten Stammtdsung die mit NH 4 OH auf pH 

1 5 8,0 eingestellt wurde) 

2mg/l FeS0 4 

2mg/l MnS0 4 



mit NH 4 OH auf pH 7,8 eingestellt, autoklaviert (121°C, 20 min). Zusatztich wird Vitamin B12 
(Hydroxycobalamin Sigma Chemicals) aus einer StammlSsung (200 pg/ml, steril filtriert) bis zu 
20 einer Endkonzentration von 100 jjg/l zugegeben 

Gebildetes Methionin, sowie andere AminosSuren in der KuIturbrOhe wurde mit Hilfe der Ami- 
nos§uresaure-Bestimmungsmethode von Agilent auf einer Agilent 1100 Series LC System 
HPLC. Eine Derivatisierung vor der SSulentrennung mit Ortho-Phthalaldehyd eriaubte die Quan- 
25 tifizierung der gebildeten AminosSuren. Die Auftrennung des Aminosauregemisch fand auf einer 
Hypersil AA-Saule (Agilent) statt 

Solche Klone wurden isoliert, deren Methionin-Produktivitat mindestens doppelt so hoch war, wie 
die des Ausgangsstamm LU1479 lysC 311ile. Ein solcherKlon wurde fQr die weiteren Versuche 
30 eingesetzt und bekam die Bezeichnung LU1479 lysC 31 1 ile ET-16. 



Belsplel 7: Klonierung von metH aus Streptomyces coelicolorund Klonierung in das Plasmid pC 
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PhsdhmetH_Sc 

a) Chromosomale DNA wurde aus Streptomyces coelicolor Siamm ATCC BAA-471 (von 
der American Type Strain Culture Collection, (ATCC) Atlanta, USA, unter der Bestellnummer 

5 BAA-471 D erhSltlich) isoliert. Chromosomale DNA aus C. glutamicum ATCC 1 3032 wurde nach 
Tauch et al. (1995) Plasmid 33:168-179 oder Eikmanns et al. (1994) Microbiology 140:1817- 
1828 prapariert. 

Mitden Oligonukleotidprimer SEQ ID NO:69 und SEQ ID NO:70, der chromosomalen DNA aus 
10 C. glutamicum als Template und Pfu Turbo Polymerase (Fa. Stratagene) wurde mit Hilfe der 
Polymerase-Kettenreaktion (PCR) nach Standardmethoden, wie Innis et al. (1 990) PCR Proto- 
cols. A Guide to Methods and Applications, Academic Press, ein DNA Fragment von ca. 180 
Basenpaaren aus dem nichtkodierenden 5-Bereich (Promotorregion)der Homoserindehydro- 
genase (HsDH) amplifiziert Das amplifizierte Fragment ist an seinem S^Ende von einer Xhol- 
15 Restriktionsschnittstelte und am 3-Ende von einem Ober das Oligo eingefOhrten zu metH aus 
Streptomyces coelicolor homologen Bereich flankiert. 

SEQIDNO:69 

' S'-GAGACTCGAGGGAAGGTGAATCGAATTTCGG-S' 
20 und 

SEQIDNO:70 

S'-GTCCCGGGGAGAACGCACGATTCTCCAAAAATAATCGC-S 1 

Das erhaltene DNA Fragment wurde mit dem GFX™PCR, DNA and Gel Band Purification Kit 
25 (Amersham Pharmacia, Freiburg) nach Angaben des Herstellers gereinigt. 

b) Ausgehend von der chromosomalen DNA aus Streptomyces coelicolor als Template fOr 
eine PCR Reaktion wurde mit den Oligonukleotidprimern SEQ ID NO:71 und SEQ ID NO:72 ein 
Teil von metH mit Hilfe des GC-RICH PCR Systems (Roche Diagnostics, Mannheim) nach An- 

30 gaben des Herstellers amplifiziert. Das amplifizierte Fragment ist an seinem 5-Ende von einem 
Ober das Oligo eingefOhrten, zur Promotorregion von HsDH aus C. glutamicum homologen 
Bereich flankiert. 
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SEQ ID NO:71 

S'-GAATCGTGCGTTCTCCCCGGGAC -3' 
und 

SEQ ID NO:72 

S'-GTAGTTGACCGAGTTGATCACC -3' 

Das ca. 1 ,4 kb grolie erhaltene DNA Fragment wurde mit dem GFX™PCR, DNA and Gel Band 
Purification Kit (Amersham Pharmacia, Freiburg) nach Angaben des Herstellers gerelnigt 

c) In einer weiteren PGR Reaktion wurden die beiden oben erhaltenen Fragmente gemein- 
sam als Template eingesetzt. Durch die mit dem Oligonukleotidprimem SEQ ID NO:71 und SEQ 
ID NO:70 eingebrachten, zu dem jeweils anderen Fragment homologen Bereichen, kommt es im 
Zuge der PCR-Reaktion zu einer Anlagerung beider Fragmente aneinander und einer Ver- 
ISngerung zu einem durchgehenden DNA-Strang durch die eingesetzte Polymerase. Die Stan- 
dardmethode wurde dahingehend modifiziert, dass die verwendeten OligonukleotidprimerSEQ 
ID NO:69 und SEQ ID NO:72 erst mit Beginn des 2. Zyklus dem Reaktionsansatz zugegeben 
wurden. 

Das amplifizierte DNA Fragment von ungefShr 1 f 6 kb wurde mit dem GFX™PCR, DNA and Gel 
Band Purification Kit nach Angaben des Herstellers gereinigt. Im Anschluss daran wurde es mit 
den Restriktionsenzymen Xhol und Notl (Roche Diagnostics, Mannheim) gespalten und gelelek- 
trophoretisch aufgetrennt AnschlieBend wurde das ca. 1,6 kb grolie DNA Fragment mit 
GFX™PCR, DNA and Gel Band Purification Kit (Amersham Pharmacia, Freiburg) aus der Aga- 
rose aufgereinigL 

d) Der noch fehlende 3 -Bereich von metH wurde ausgehend von der chromosomalen DNA 
aus Streptomyces coelicolor als Template mit den Oligonukleotidprimem SEQ ID NO:73 und 
SEQ ID NO:74 mit Hilfe des GC-RICH PCR Systems (Roche Diagnostics, Mannheim) nach An- 
gaben des Herstellers amplifiziert. Das amplifizierte Fragment ist an seinem 3-Ende von einer 
Ober das OHgo eingefQhrten EcoRV-Restriktionsschnittstelle flankiert 

SEQ ID NO:73 

5 , -CCGGCCTGGAGAAGCTCG-3 , 
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and ' 
SEQ ID NO:74 

5'-GAGAGATATCCCTCAGCGGGCGTTGAAG-3' 

5 Das erhaltene ca. 2,2 kb groBe DNA Fragment wurde mit dem GFX™PCR, DNA and Gel Band 
Purification Kit (Amersham Pharmacia, Freiburg) nach Angaben des Herstellers gereinigt Im 
AnschluB daran wurde es mit den Restriktionsenzymen Notl und EcoRV (Roche Diagnostics, 
Mannheim) gespalten und gelelektrophorefisch aufgetrennt Anschlieflend wurde das ca. 2,2 kb 
grolie DNA Fragment mit GFX™PCR, DNA and Gel Band Purification Kit (Amersham Phar- 
1 0 macia, Freiburg) aus der Agarose aufgereinigt 

e) Der Vektor pClik5MCS SEQ ID NO:63 (Beispiel 1 ) wurde mit den Restriktionsenzymen 
Xhol und EcoRV (Roche Diagnostics, Mannheim) geschnitten und ein 5 kb groBes Fragment 
nach elektrophoretischer Auftrennung mit GFX^PCR, DNA and Gel Band Purification Kit isoliert 

15 

Das Vektorfragment wurde zusammen mit den beiden geschnittenen und aufgereinigten PGR- 
Fragmenten mit Hilfe des Rapid DNA Ligation Kit (Roche Diagnostics, Mannheim) nach An- 
gaben des Herstellers ligiert und der Ligationsansatz nach Standardmethoden wie in Sambrook 
et al. (Molecular Cloning. A Laboratory Manual, Cold Spring Harbor. beschrieben(1989)), in 
20 kompetente E.coli XL-1 Blue (Stratagene. La Jolla, USA) transformiert Eine Selektion auf Plas- 
mid-tragende Zellen wurde durch das Ausplattieren auf Kanamycin (20pg/ml) haltigen LB Agar 
(Lennox, 1 955, Virology, 1 : 1 90) erreicht 

Die Praparation der Plasmid DNA wurde nach Methoden und mit Materialien der Fa. Quiagen 
25 durchgefOhrt. Sequenzierungsreaktionen wurden nach Sanger et al. (1 977) Proceedings of the 
National Academy of Sciences USA 74:5463-5467 durchgefOhrt Die SequenzierTeaktionen wur- 
den mittels ABI Prism 377 (PE Applied Biosystems, Weiterstadt) aufgetrennt und ausgewertet 

Das entstandene Plasmid pC Phsdh metH_Sc {Streptomyces coelicotor) ist als SEQ ID NO:79 
30 aufgefOhrt. 

Die Sequenz SEQ ID NO:79 umfasst die folgenden wesentlichen Teilbereiche: 
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Position 


Art der Se- 
quenz 


Beschreibung 


6-155 


Promoter 


HsDH 


156-3752 


CDS 1 ' 


MetH S. coelicolor 


4153-4944 


CDS 


Kanamycin-Resistenz 


5211 -6071 
(complement) 2 * 


CDS 


Replikationsursprung/ 
E.coli/Plasmid pMB 



1) Kodierende Sequenz 
21 auf Komplementarstrang 



Beispiel 8: Transformation des Stammes LU1479 lysC 31 1ile ET-16 mitdem Plasmid pC Phsdh 
metH_Sc 

Der Stamm LU1479 lysC 31 1 ile ET-16 (Beispiel 5) wurde mit dem Plasmid pC Phsdh metH_Sc 
1 0 (Beispiel 7) nach der beschriebenen Methode (Liebl, et al. (1 989) FEMS Microbiology Letters 
53:299-303) transformiert Die Transformationsmischung wurde auf CM-Platten plattiert, die 
zusatzlich 20mg/l Kanamycin enthielten, urn eine Selektion auf Plasmid-haltige Zellen zu enei- 
chen. Erhaltene Kan-resistente Klone wurden gepickt und vereinzelt Die Methionin-Produktivitat 
der Klone wurde in einem SchOttelkolbenversuch (s. Beispiel 6) untersucht Der Stamm LU1479 
15 lysC 31 1ile ET-16 pC Phsdh metH_Sc produzierte im Vergleich zu LU1479 lysC 311 ile ET-16 
signifikant mehr Methionin. 
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PatentansprOche 

Verfahren zur fermentativen Herstellung wenigstens einer schwefelhalligen 
Feinchemikalie, welches folgende Schritte umfasst: 

a) Fermentation einer die gewOnschte schwefelhaltige Feinchemikalie 
produzierenden coryneformen Bakterienkultur, wobei in den coryneformen 
Bakterien zumindest eine heterologe Nukleotidsequenz exprimiert wird, 
welche fOr ein Protein mit Methionin-Synthase (metF) -Aktivitat kodiert; 

b) Anreicherung der schwefelhaltigen Feinchemikalie im Medium Oder in den 
Zellen der Bakterien, und 

c) Isolieren der schwefelhaltigen Feinchemikalie. 

2. Verfahren nach Anspruch 1, wobei die schwefelhaltige Feinchemikalie L-Methlonin 
15 umfasst 

3. Verfahren nach einem der vorhergehenden AnsprOche, wobei sich die heterologe metF- 
kodierende Nukleotidsequenz zur metF-kodierenden Sequenz aus Corynebacterium 
glutamicum ATCC 13032 eine Sequenzhomologie vom weniger als 100% aufweist 

20 

4. Verfahren nach Anspruch 3, wobei die metF-kodierende Sequenz aus einem der 
folgenden Organismen abgeleitet ist 



Organimsus 


Stammsammlung 


Corynebacterium diphteriae 


ATCC 14779 


Streptomyces lividans 


ATCC 19844 


Streptomyces coelicolor 


ATCC 10147 


Aquifex aeolicus 


DSM 6858 


Burkholderia cepacia 


ATCC 25416 


Nitrosomonas europaea 


ATCC 19718 


Pseudomonas aeruginosa 


ATCC 17933 


Xylella fastidiosa 


ATCC 35881 


Pseodomonas fluorescens 


ATCC 13525 


Schizosaccharomyces pombe 


ATCC 24969 


Saccharomyces cerevisiae 


ATCC 10751 


Erwinia carotovora 


ATCC 15713 


Klebsiella pneumoniae 


ATCC 700721 


Salmonella typhi 


ATCC 12839 


Salmonella typhimurium 


ATCC 15277 


Escherichia coli K12 


ATCC55151 



NAE 289/2002 58/Dp 27.08.2002 
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Vibno cholerae 


ATCC 39315 


Haemophilus influenzae 


ATCC 51907 


Caulobacter crescentus 


ATCC 19089 


Actinobacillus 
actinomycetemcomitans 


ATCC 33384 


Neisseria meningitis 


ATCC 6253 


Rhodobacter capsulatus 


ATCC 11166 


Campylobacter jejuni 


ATCC 33560 


Lactococcus lactis 


ATCC 7962 


Prochlorococcus marinus 


PCC7118 


Bacillus stearothennophilus 


ATCC 12980 



5. Verfahren nach einem der vorhergehenden AnsprOche, wobei die metF-kodierende 
Sequenz eine kodierende Sequenz gemSli SEQ ID NO:1, 3 f 5, 7 ? 9, 11, 13, 15, 17, 19, 
21, 23, 25, 27, 29. 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51 und 53 oder eine dazu 
homologe Nukleotidsequenz, welche fur ein Protein mit metF-Aktivitat kodiert, umfasst 

6. Verfahren nach einem der vorhergehenden AnsprOche, wobei die metF-kodierende 
Sequenz fQr ein Protein mit metF-Aktivitat kodiert, wobei das Protein eine 
Aminosauresequenzgemaa SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22,24, 26, 28, 
30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52 und 54 oder eine dazu homologe 
AminosSuresequenz, welche fOr ein Protein mit metF-Aktivitat steht, umfasst 

7. Verfahren nach einem der vorhergehenden AnsprOche, wobei die kodierende metF- 
Sequenz eine in coryneformen Bakterien replizierbare oder eine stabil in das 
Chromosom intregrierte DNA oder eine RNA ist 

8. Verfahren gernad Anspruch 7, wobei man 

a) einen mit einem Plasmidvektor transformierten Bakterienstamm einsetzt der 
wenigstens eine Kopie der kodierenden metF-Sequenz unter der Kontrolle 
regulativer Sequenzen trSgt, oder 

b) einen Stamm einsetzt, in dem die kodierende metF-Sequenz in das Chromosom 
des Bakteriums integriert wurde 



9. 



Verfahren nach einem der vorhergehenden AnsprOche, wobei die kodierende metF- 
Sequenz Oberexprimiert wird. 
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10. Verfahren gemaB einem der vorhergehenden Ahspruche, wobei man Bakterien 
fermentiert, in denen zusdtzlich wenigstens ein weiteres Gen des Biosyntheseweges der 
gewQnschten schwefethaltigen Feinchemikalie verstdrkt ist Oder derart mutiert ist, dass 

5 es durch Stoffwechselmetabolite nicht in seiner Aktivitat beeinflusst wird. 

■ ■ i< 

11. Verfahren gemaii einem der vorhergehenden AnsprQche, wobei man Bakterien 
fermentiert, in denen wenigstens ein Stoffwechselweg zumindest teilweise ausgeschaltet 
sind, der die Bildung der gewQnschten schwefelhaltigen Feinchemikalie verringert 

10 

12. Verfahren gemafi einem der vorhergehenden AnsprQche, wobei man coryneforme 
Bakterien fermentiert in denen gleichzeitig wenigstens eines der Gene, ausgewdhlt unter 

dem fOr eine Aspartatkinase kodierenden Gen lysC, 

dem fOr die Glycerinaldehyd-3-Phosphat Dehydrogenase kodierenden Gen gap, 
dem fQr die 3-Phosphoglycerat Kinase kodierenden Gen pgk, 
dem fur die Pyruvat Carboxylase kodierenden Geri pyc, 
dem fQr die Triosephosphat Isomerase kodierenden Gen tpi, 
dem fOr die Homoserin O-Acetyltransferase kodierenden Gen metA, 
dem fQr die Cystahionin-gamma-Synthase kodierenden Gen metB, 
dem fQr die Cystahionin-gamma-Lyase kodierenden Gen metC, 
dem fQr die Serin-Hydroxymethyltransferase kodierenden Gen glyA, 
dem fQr die O-Acetylhomoserin-Sulfhydrylase kodierenden Gen metY, 
dem fur das metH Gen, das fOr die Vitamin B12 abhangige Methionin-Synthase 
kodiert, 

dem fur das serC Gen, das fQr die Phosphoserin-Aminotransferase kodiert, 
dem serB Gen, das fQr die Phosphoserin-Phosphatase kodiert, 
dem cysE Gen, das fur die Serine Acetyl-Transferase kodiert, und 
dem horn Gen, das eine Homoserin-Dehydrogenase kodiert, 



15 



20 



25 



a) 

b) 

c) 

d) 

e) 

0 

9) 

h) 

i) 

j) 

k) 

I) 

m) 
n) 
o) 



Qberexprimiert oder so mutiert ist, dass die korrespondierenden Proteine, verglichen mit 
nicht mutierten Proteinen, in geringerem Made oder nicht durch Stoffwechselmetabolite 
in ihrer Aktivitat beeinflusst werden. 
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13. Verfahren gemSli einem der vortiergehenden AnsprOche, wobei man coryneformen 
Bakterien fermentiert, in denen gleichzeitig wenigstens eines der Gene, ausgewShlt unter 

a) dem fOr die Homoserine-Kinase kodierenden Gen thrB, 

b) dem fOr die Threonin Dehydratase kodierenden Gen ilvA, 

c) dem fQr die Threonin Synthase kodierenden Gen thrC 

d) dem fQr die Meso-Diaminopimelat D-Dehydrogenase kodierenden Gen ddh 

e) dem fQr die Phosphoenolpyruvat-Carboxykinase kodierenden Gen pck, 
0 dem fQr die Glucose-6-Phosphat-6-lsomerase kodierenden Gen pgi, 

g) dem fur die Pyruvat-Oxidase kodierenden Gen poxB, 

h) dem fQr die Dihydrodipicolinat Synthase kodiemden Gen dapA, 

i) dem fQr die Dihydrodipicolinat Reduktase kodiemden Gen dapB; oder 
j) dem fQr die Diaminopicolinat Decarboxylase kodiemden Gen 

durch VerSnderung der Expressionsrate Oder durch EinfQhrung einer gezielten Mutation 
abschwScht ist 

14. Verfahren gemSB einem oder mehreren der vortiergehenden AnsprOche, wobei man 
Mikroorganismen der Art Corynebacterium glutamicum einsetzt. 

15. Verfahren zur Herstellung eines L-Methionin haltigen Tierfuttermittel-Additivs aus 
Femnentationsbruhen, welches folgende Schritte umfasst 

a) Kultivierung und Fermentation eines L-Methionin produzierenden 
Mikroorganismus in einem Fermentationsmedium; 

b) Entfemung von Wasser aus der L-Methionin haltigen FermentationsbrOhe; 

c) Entfemung der wShrend der Fermentation gebildeten Biomasse in einer Menge 
von 0 bis 100 Gew.-%; und 

d) Trocknung der gemali b) und/oder c) erhaltenen FermentationsbrOhe, urn das 
Tierfuttermittel-Additiv in der gewunschten Pulver- Oder Granulatform zu ertialten. 

16. Verfahren gemali Anspmch 15, wobei man Mikroorganismen gemali der Definition in 
einem der AnsprOche 1 bis 14 einsetzt. 
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SEQUENZPROTOKOLL 

<110> BASF Aktiengesellschaft 

<120> MetH 

<130> M/43120 

<140> 

<141> 

<160> 79 

<210> 1 
<211> 3597 
<212> DNA 

<213> Streptomyces coelicolor 

<220> 
<221> CDS 
<222> <1)..(3594) 
<223> RSX14254 

<400> 1 

gtg cgt tct ccc egg gac gtc cca cga egg gcg gca ccg ggc aga ggc 48 

Val Arg Ser Pro Arg Asp Val Pro Arg Arg Ala Ala Pro Gly Arg Gly 
1 5 10 .15 

aaa gec gac age cgt cgc ate eta ggg age cct ttc atg gec teg teg 96 
Lye Ala Asp Ser Arg Arg He Leu Gly Ser Pro Phe Met Ala Ser Ser 
20 25 30 

cca tec acc ccg ccc gec gac ace cgc acc cgc gtg tec gec etc cga 144 
Pro Ser Thr Pro Pro Ala Asp Thr Arg Thr Arg Val Ser Ala Leu Arg 
35 40 45 

gag gec etc gec acc cgc gtg gtg gtc gec gac ggc gec atg ggc acc 192 
Glu Ala Leu Ala Thr Arg Val Val Val Ala Asp Gly Ala Met Gly Thr 
50 55 so 



atg etc cag gec cag aac ccc acg ctg gac gac ttc cag cag etc gaa 
Met Leu Gin Ala Gin Asn Pro Thr Leu Asp Asp Phe Gin Gin Leu Glu 
65 70 75 80 

ggg tgc aac gag gtc ctg aac etc acc egg ccc gac ate gtc cgc teg 
Gly Cys Asn Glu Val Leu Asn Leu Thr Arg Pro Asp He Val Arg Ser 
85 90 95 

gtg cac gag gag tac ttc gcg gec ggc gtc gac tgc gtc gag acc aac 
Val His Glu Glu Tyr Phe Ala Ala Gly Val Asp Cys Val Glu Thr Asn 
100 105 no 

fu° 11° H? C 9 f C aaC Cac tcc 9CC ct 9 9 a 9 tac 9«c ate ccc gag 
Thr Phe Gly Ala Asn His Ser Ala Leu Gly Glu Tyr Asp He Pro Glu 

115 120 125 

cgc gtc cac gaa ctg tcc gag gee ggc gec cgc gtc gee cgc gag gtc 
Arg Val His Glu Leu Ser Glu Ala Gly Ala Arg Val Ala Arg Glu Val 
"0 135 140 

gec gac gag ttc ggc gec cgc gac ggc egg cag cgc tgg gtg ctg ggc 



240 



288 



336 



384 



432 



480 
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Ala Asp Glu Phe Gly Ala Arg Asp Gly Arg Gin Arg Trp Val Leu Gly 
145 150 155 160 

tec atg ggc ccc ggc acc aag etc ccc acc etc ggc cac gec ccg tac 528 
Ser Met Gly Pro Gly Thr Lys Leu Pro Thr Leu Gly His Ala Pro Tyr 
165 170 175 

acc gtc ctg cgc gac gec tac cag cgc aac gec gag gga ctg gtc gcg 576 
Thr Val Leu Arg Asp Ala Tyr Gin Arg Asn Ala Glu Gly Leu Val Ala 
180 185 190 

ggc ggc gcg gac gca ctg ctg gtg gag acc acg cag gac ctg etc cag 624 
Gly Gly Ala Asp Ala Leu Leu Val Glu Thr Thr Gin Asp Leu Leu Gin 
195 200 205 

acc aag gee teg gtg etc ggc gec egg cgc gee ctg gac gtc etc ggc 672 
Thr Lys Ala Ser Val Leu Gly Ala Arg Arg Ala Leu Asp Val Leu Gly 
210 215 220 

etc gac ctg ccg etc ate gtg tec gtc acc gtc gag acc acc ggc acc 720 
Leu Asp Leu Pro Leu He Val Ser Val Thr Val Glu Thr Thr Gly Thr 
225 230 235 240 

atg ctg etc ggc teg gag ate ggc gec gcg etc acc gcg ctg gaa ccg 768 
Met Leu Leu Gly Ser Glu He Gly Ala Ala Leu Thr Ala Leu Glu Pro 
245 250 255 

etc ggc ate gac atg ate ggc ctg aac tgc gec acc ggc ccc gec gag 816 
Leu Gly He Asp Met He Gly Leu Asn Cys Ala Thr Gly Pro Ala Glu 
260 265 270 

atg age gag cac ctg cgc tac etc gec egg cac tec cgc ate ccg ctg 864 
Met Ser Glu His Leu Arg Tyr Leu Ala Arg His Ser Arg He Pro Leu 
275 280 285 

acc tgc atg ccc aac gee ggt ctg ccc gtc etc ggc aag gac ggc gec 912 
Thr Cys Met Pro Asn Ala Gly Leu Pro Val Leu Gly Lys Asp Gly Ala 
290 295 300 

cac tac ccg ctg acc gcg ccc gag ctg gee gac gca cac gag acc ttc 960 
His Tyr Pro Leu Thr Ala Pro Glu Leu Ala Asp Ala His Glu Thr Phe 
305 310 315 320 

gtg cgc gag tac ggc ctg tec ctg gtc ggc ggc tgc tgc ggc acc acg 1008 
Val Arg Glu Tyr Gly Leu Ser Leu Val Gly Gly Cys Cys Gly Thr Thr 
325 330 335 

ccc gag cac ctg cgc cag gtc gtc gag egg gtc egg gac acc gec ccc 1056 
Pro Glu His Leu Arg Gin Val Val Glu Arg Val Arg Asp Thr Ala Pro 
340 345 350 

acc gca cgc gac ccg cgc ccc gag ccc ggc gec gec teg etc tac cag 1104 
Thr Ala Arg Asp Pro Arg Pro Glu Pro Gly Ala Ala Ser Leu Tyr Gin 
355 360 365 

acc gtg ccc ttc cgc cag gac acc tec tac ctg gee ate ggc gag cgc 1152 
Thr Val Pro Phe Arg Gin Asp Thr Ser Tyr Leu Ala He Gly Glu Arg 
370 375 380 

acc aac gee aac ggg tec aag aag ttc cgc gag gee atg ctg gac ggc 1200 
Thr Asn Ala Asn Gly Ser Lys Lys Phe Arg Glu Ala Met Leu Asp Gly 
385 390 395 400 
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cgc tgg gac gac tgc gtc gag atg gcc cgc gac cag ate cgc gaa ggc 1248 
Arg Trp Asp Asp Cys Val Glu Met Ala Arg Asp Gin He Arg Glu Gly 
405 410 " 415 

gcg cac atg etc gac etc tgc gtc gac tac gtc ggc egg gac ggc gtc 1296 
Ala His Met Leu Asp Leu Cys Val Asp Tyr Val Gly Arg Asp Gly Val 
420 425 * 430 

gcc gac atg gag gaa ctg gcc ggc egg ttc gcc acc gcc tec acg ctg 1344 
Ala Asp Met Glu Glu Leu Ala Gly Arg Phe Ala Thr Ala Set Thr Leu 
435 440 445 

ccg ate gtc etc gac tec acc gag gtc gac gtc ate egg gcc ggc ctg 1392 
Pro He Val Leu Asp Ser Thr Glu Val Asp Val He Arg Ala Gly Leu 
450 455 460 

gag aag etc ggc ggc cgc gcg gtg ate aac teg gtc aac tac gag gac 1440 
Glu Lys Leu Gly Gly Arg Ala Val He Asn Ser Val Asn Tyr Glu Asp 
465 470 475 480 

ggc gcc ggc ccc gag tec egg ttc gcc cgc gtc acg aag etc gcc egg 1488 
Gly Ala Gly Pro Glu Ser Arg Phe Ala Arg Val Thr Lys Leu Ala Arg 
485 490 495 

gag cac ggc gcc gcg ctg ate gcg ctg acc ate gac gag gtg gga cag 1536 
Glu His Gly Ala Ala Leu He Ala Leu Thr He Asp Glu Val Gly Gin 
500 505 510 

gcc cgc acc gcc gag aag aag gtc gag ate gcc gaa egg etc ate gac 1584 
Ala Arg Thr Ala Glu Lys Lys Val Glu He Ala Glu Arg Leu He Asp 
515 520 525 

gac etc acc ggc aac tgg ggc ate cac gag tec gac ate etc gtc gac 1632 
Asp Leu Thr Gly Asn Trp Gly He His Glu Ser Asp He Leu Val Asp 
530 535 540 



tgc ctg acc ttc acc ate tgc acc ggc cag gag gag tec cgc aag gac 
Cys Leu Thr Phe Thr He Cys Thr Gly Gin Glu Glu Ser Arg Lys Asp 
545 550 555 560 



1680 



ggc ctg gcc acc ate gag ggc ate egg gaa etc aag egg cgc cac ccg 1728 
Gly Leu Ala Thr He Glu Gly He Arg Glu Leu Lys Arg Arg His Pro 
565 570 575 

gac gtg cag acc acg etc ggc ctg teg aac ate tec ttc ggc etc aac 1776 
Asp Val Gin Thr Thr Leu Gly Leu Ser Asn He Ser Phe Gly Leu Asn 
580 585 590 

ccg gcc gcc cgc ate ctg etc aac tec gtc ttc etc gac gaa tgc gtc 1824 
Pro Ala Ala Arg He Leu Leu Asn Ser Val Phe Leu Asp Glu Cys Val 
595 600 605 

aag gcc ggc ctg gac teg gcc ate gtg cac gcg age aag ate ctg ccg 1B72 
Lys Ala Gly Leu Asp Ser Ala He Val His Ala Ser Lys He Leu Pro 
610 615 620 

ate gcc cgc ttc gac gag gag cag gtc acc acc gcc etc gac ttg ate 1920 
He Ala Arg Phe Asp Glu Glu Gin Val Thr Thr Ala Leu Asp Leu He 
625 630 635 * 640 

tac gac cgc cgc cgc gag ggc tac gac ccc ctg caa aag etc atg cag 1968 
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Tyr Asp Arg Arg Arg Glu Gly Tyr Asp Pro Leu Gin Lys Leu Met Gin 
645 650 655 

etc ttc gag ggc gec acc gec aag teg ctg aag gec tec aag gee gag 2016 
Leu Phe Glu Gly Ala Thr Ala Lye Ser Leu Lys Ala Ser Lys Ala Glu 
660 665 670 

gaa ctg gec gec etc ccg ctg gag gag cgc etc aag cgc cgc ate ate 2064 
Glu Leu Ala Ala Leu Pro Leu Glu Glu Arg Leu Lys Arg Arg He He 
675 680 685 

gac ggc gag aag aac ggc etc gaa cag gac etc gac gag gec etc egg 2112 
Asp Gly Glu Lys Asn Gly Leu Glu Gin Asp Leu Asp Glu Ala Leu Arg 
690 695 700 

gag cgc ccg gec etc gag ate gtc aac gac acc ctg etc gac ggt atg 2160 
Glu Arg Pro Ala Leu Glu He Val Asn Asp Thr Leu Leu Asp Gly Met 
705 710 715 720 

aag gtc gtc ggc gag ctg ttc ggc tec ggc cag atg cag ctg ccg ttc 2208 
Lys Val Val Gly Glu Leu Phe Gly Ser Gly Gin Met Gin Leu Pro Phe 
725 730 735 

gtg etc cag tec gee gag gtc atg aag acc gcg gtg gec cac ctg gag 2256 
Val Leu Gin Ser Ala Glu Val Met Lys Thr Ala Val Ala His Leu Glu 
740 745 750 

ccg cac atg gag aag acc gac gac gac ggc aag ggc acg ate gtg ctg 2304 
Pro His Met Glu Lys Thr Asp Asp Asp Gly Lys Gly Thr He Val Leu 
755 760 765 

gee acc gtc cgc ggc gac gtc cac gac ate ggc aag aac etc gtc gac 2352 
Ala Thr Val Arg Gly Asp Val His Asp He Gly Lys Asn Leu Val Asp 
770 775 780 

ate ate ctg tec aac aac ggc tac aac gtc gtc aac etc ggc ate aag 2400 
He He Leu Ser Asn Asn Gly Tyr Asn Val Val Asn Leu Gly He Lys 
785 790 795 800 

cag ccc gtc tec gcg ate ctg gaa gcg gee gac gag cac egg gec gac 2448 
Gin Pro Val Ser Ala He Leu Glu Ala Ala Asp Glu His Arg Ala Asp 
BOS 810 815 

gtc ate ggc atg tec ggc etc etc gtc aag tec acg gtg ate atg aag 2496 
Val He Gly Met Ser Gly Leu Leu Val Lys Ser Thr Val He Met Lys 
820 825 830 

gag aac ctg gag gag ctg aac cag cgc aag ctg gee gee gac tac ccg 2544 
Glu Asn Leu Glu Glu Leu Asn Gin Arg Lys Leu Ala Ala Asp Tyr Pro 
835 840 845 

gtc ate etc ggc ggc gee gee etc acc agg gee tac gtc gaa cag gac 2592 
Val He Leu Gly Gly Ala Ala Leu Thr Arg Ala Tyr Val Glu Gin Asp 
850 855 860 

ctg cac gag ate tac gac ggc gag gtc cgc tac gee cgc gac gec ttc 2640 
Leu His Glu He Tyr Asp Gly Glu Val Arg Tyr Ala Arg Asp Ala Phe 
865 870 875 880 

gag ggc ctg cgc etc atg gac gec etc ate ggc ate aag cgc ggc gtg 2688 
Glu Gly Leu Arg Leu Met Asp Ala Leu He Gly He Lys Arg Gly Val 
885 890 895 
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ccc ggc gcc aag ctg ccg gag ctg aag cag cgc egg gtg egg gee gec 2736 
Pro Gly Ala Lys Leu Pro Glu Leu Lys Gin Arg Arg Val Arg Ala Ala 
900 905 910 , 

acc gtc gag ate gac gag cgc ccc gag gaa ggc cac gtc cgc tec gac 2784 
Thr Val Glu He Asp Glu Arg Pro Glu Glu Gly His Val Arg Ser Asp 
915 920 925 

gtc gee acc gac aac ccg gtc ccg acc ccg ccc ttc cgc ggc, acc cgc 2832 
Val Ala Thr Asp Abii Pro Val Pro Thr Pro Pro Phe Arg Gly" Thr Arg 
930 935 940 

gtc gtc aag ggc ate cag etc aag gag tac gcc tec tgg etc gac gag 2880 
Val Val Lys Gly He Gin Leu Lys Glu Tyr Ala Ser Trp Leu Asp Glu 
945 950 955 * 960 

ggc gcc etc ttc aag ggc cag tgg ggc etc aag cag gcc cgc acc ggc 2928 
Gly Ala Leu Phe Lys Gly Gin Trp Gly Leu Lys Gin Ala Arg Thr Gly 
965 970 975 

gag gga ccc tec tac gag gaa ctg gtc gag tec gag ggc egg ccg egg 2976 
Glu Gly Pro Ser Tyr Glu Glu Leu Val Glu Ser Glu Gly Arg Pro Arg 
980 985 990 

ctg cgc ggc ctg etc gac egg etc cag acg gac aac ctt ttg gag gcg 3024 
Leu Arg Gly Leu Leu Asp Arg Leu Gin Thr Asp Asn Leu Leu 6lu Ala 
995 1000 1005 

gcc gtg gtc tac ggc tac ttc ccc tgc gtc tec aag gac gac gac ctg 3072 
Ala Val Val Tyr Gly Tyr Phe Pro Cys Val Ser Lys Asp Asp Asp Leu 
1010 1015 1020 

ate gtc etc gac gac gac ggc aac gaa cgc acc cgc ttc acc ttc ccc 3120 
He Val Leu Asp Asp Asp Gly Asn Glu Arg Thr Arg Phe Thr Phe Pro 
1025 1030 1035 1040 

cgc cag cgc cgc ggc egg cgc ctg tgc ctg gcc gac ttc ttc cgc ccg 3168 
Arg Gin Arg Arg Gly Arg Arg Leu Cys Leu Ala Asp Phe Phe Arg Pro 

1045 1050 1055 

gag gag tec ggc gag acc gac gtg gtc ggc ttc cag gtc gtc acc gtc 3216 
Glu Glu Ser Gly Glu Thr Asp Val Val Gly Phe Gin Val Val Thr Val 
1060 1065 1070 

ggc tec cgc ate ggc gag gag acg gcc cgc atg ttc gag gcc aac gcc 3264 
Gly Ser Arg He Gly Glu Glu Thr Ala Arg Met Phe Glu Ala Asn Ala 
1075 1080 1085 

tac cgc gac tat etc gag ctg cac ggc ctg tec gtg cag etc gcc gag 3312 
Tyr Arg Asp Tyr Leu Glu Leu His Gly Leu Ser Val Gin Leu Ala Glu 
1090 1095 1100 

gcc etc gcc gag tac tgg cac gcg cgc gtg cgc teg gaa etc ggc ttc 3360 
Ala Leu Ala Glu Tyr Trp His Ala Arg Val Arg Ser Glu Leu Gly Phe 
H05 1110 1115 H20 

gcc ggg gag gac ccg gcc gag atg gag gac atg ttc gcc ctg aag tac 3408 
Ala Gly Glu Asp Pro Ala Glu Met Glu Asp Met Phe Ala Leu Lys Tyr 
1125 H30 1135 
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c 99 ggt gcc cgc ttc tec etc ggc tac ggc gec tgc ccc gac ctg gag 3456 
Arg Gly Ala Arg Phe Ser Leu Gly Tyr Gly Ala Cys Pro Asp Leu Glu 
1140 1145 - ' 1150 

gac cgc gcc aag ate gcc gcc ctg ctg gag ccc gag cgc ate ggc gtc 3504 
Asp Arg Ala Lys He Ala Ala Leu Leu Glu Pro Glu Arg lie Gly Val 
1155 1160 H65 

cac eta tec gag gag ttc cag etc cac ccc gag cag tec acc gac gcc 3552 
His Leu Ser Glu Glu Phe Gin Leu His Pro Glu Gin Ser Thr Asp Ala 
1170 1175 H80 

ate gtc ate cac cac ccg gag gcc aag tac ttc aac gcc cgc 3594 

He Val He His His Pro Glu Ala Lys Tyr Phe Asn Ala Arg 
1185 1190 1195 

tga 3597 

<210> 2 
<2U> 1198 
<212> PRT 

<213> Streptomyces coelicolor 
<400> 2 

Val Arg Ser Pro Arg Asp Val Pro Arg Arg Ala Ala Pro Gly Arg Gly 
1 5 10 15 

Lys Ala Asp Ser Arg Arg He Leu Gly Ser Pro Phe Met Ala Ser Ser 
20 25 30 

Pro Ser Thr Pro Pro Ala Asp Thr Arg Thr Arg Val Ser Ala Leu Arg 
35 40 45 

Glu Ala Leu Ala Thr Arg Val Val Val Ala Asp Gly Ala Met Gly Thr 
50 55 60 

Met Leu Gin Ala Gin Asn Pro Thr Leu Asp Asp Phe Gin Gin Leu Glu 
65 70 75 80 

Gly Cys Asn Glu Val Leu Asn Leu Thr Arg Pro Asp He Val Arg Ser 
85 90 95 

Val His Glu Glu Tyr Phe Ala Ala Gly Val Asp Cys Val Glu Thr Asn 
100 105 110 

Thr Phe Gly Ala Asn His Ser Ala Leu Gly Glu Tyr Asp lie Pro Glu 
115 120 125 

Arg Val His Glu Leu Ser Glu Ala Gly Ala Arg Val Ala Arg Glu Val 
130 135 140 

Ala Asp Glu Phe Gly Ala Arg Asp Gly Arg Gin Arg Trp Val Leu Gly 
145 150 155 160 

Ser Met Gly Pro Gly Thr Lys Leu Pro Thr Leu Gly His Ala Pro Tyr 
165 170 * 175 

Thr Val Leu Arg Asp Ala Tyr Gin Arg Asn Ala Glu Gly Leu Val Ala 
180 185 190 

Gly Gly Ala Asp Ala Leu Leu Val Glu Thr Thr Gin Asp Leu Leu Gin 
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195 200 205 

Thr Lys Ala Ser Val Leu Gly Ala Arg Arg Ala Leu Asp Val Leu Gly 
210 215 220 . * 

Leu Asp Leu Pro Leu He Val Ser Val Thr Val Glu Thr Thr Gly Thr 
225 230 235 240 

Met Leu Leu Gly Ser Glu He Gly Ala Ala Leu Thr Ala Leu Glu Pro 
245 250 ( 255 

Leu Gly lie Asp Met He Gly Leu Asn Cys Ala Thr Gly Pro Ala Glu 
260 265 270 

Met Ser Glu His Leu Arg Tyr Leu Ala Arg His Ser Arg He Pro Leu 
275 280 285 

Thr Cys Met Pro Asn Ala Gly Leu Pro Val Leu Gly Lys Asp Gly Ala 
290 295 300 

His Tyr Pro Leu Thr Ala Pro Glu Leu Ala Asp Ala His Glu Thr Phe 
305 310 315 320 

Val Arg Glu Tyr Gly Leu Ser Leu Val Gly Gly Cys Cys Gly Thr Thr 
325 330 * 335 

Pro Glu His Leu Arg Gin Val Val Glu Arg Val Arg Asp Thr Ala Pro 
340 345 350 

Thr Ala Arg Asp Pro Arg Pro Glu Pro Gly Ala Ala Ser Leu Tyr Gin 
355 360 365 

Thr Val Pro Phe Arg Gin Asp Thr Ser Tyr Leu Ala He Gly Glu Arg 
370 375 380 

Thr Asn Ala Asn Gly Ser Lys Lys Phe Arg Glu Ala Met Leu Asp Gly 
385 390 395 400 

Arg Trp Asp Asp Cys Val Glu Met Ala Arg Asp Gin He Arg Glu Gly 
405 410 415 

Ala His Met Leu Asp Leu Cys Val Asp Tyr Val Gly Arg Asp Gly Val 
420 425 * 430 

Ala Asp Met Glu Glu Leu Ala Gly Arg Phe Ala Thr Ala Ser Thr Leu 
435 440 445 

Pro lie Val Leu Asp Ser Thr Glu Val Asp Val lie Arg Ala Gly Leu 
450 455 460 

Glu Lys Leu Gly Gly Arg Ala Val He Asn Ser Val Asn Tyr Glu Asp 
465 470 475 480 

Gly Ala Gly Pro Glu Ser Arg Phe Ala Arg Val Thr Lys Leu Ala Arg 
485 490 ** 495 

Glu His Gly Ala Ala Leu lie Ala Leu Thr lie Asp Glu Val Gly Gin 
500 505 510 

Ala Arg Thr Ala Glu Lys Lys Val Glu He Ala Glu Arg Leu lie Asp 
515 520 525 
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Asp Leu Thr Gly Asn Trp Gly He His Glu Ser Asp He Leu Val Asp 
530 535 540 

Cys Leu Thr Phe Thr He Cys Thr Gly Gin Glu Glu Ser Arg Lys Asp 
545 550 555 560 

Gly Leu Ala Thr He Glu Gly He Arg Glu Leu Lys Arg Arg His Pro 
565 570 575 

Asp Val Gin Thr Thr Leu Gly Leu Ser Asn He Ser Phe Gly Leu Asn 
580 585 590 

Pro Ala Ala Arg He Leu Leu Asn Ser Val Phe Leu Asp Glu Cys Val 
595 600 605 

Lys Ala Gly Leu Asp Ser Ala He Val His Ala Ser Lys He Leu Pro 
610 615 620 

He Ala Arg Phe Asp Glu Glu Gin Val Thr Thr Ala Leu Asp Leu He 
625 630 635 640 

Tyr Asp Arg Arg Arg Glu Gly Tyr Asp Pro Leu Gin Lys Leu Met Gin 
645 650 655 

Leu Phe Glu Gly Ala Thr Ala Lys Ser Leu Lys Ala Ser Lys Ala Glu 
660 665 670 

Glu Leu Ala Ala Leu Pro Leu Glu Glu Arg Leu Lys Arg Arg He lie 
675 680 685 

Asp Gly Glu Lys Asn Gly Leu Glu Gin Asp Leu Asp Glu Ala Leu Arg 
690 695 700 

Glu Arg Pro Ala Leu Glu He Val Asn Asp Thr Leu Leu Asp Gly Met 
705 710 715 720 

Lys Val Val Gly Glu Leu Phe Gly Ser Gly Gin Met Gin Leu Pro Phe 
725 730 735 

Val Leu Gin Ser Ala Glu Val Met Lys Thr Ala Val Ala His Leu Glu 
740 745 750 

Pro His Met Glu Lys Thr Asp Asp Asp Gly Lys Gly Thr He Val Leu 
755 760 765 

Ala Thr Val Arg Gly Asp Val His Asp He Gly Lys Asn Leu Val Asp 
770 775 780 

He He Leu Ser Asn Asn Gly Tyr Asn Val Val Asn Leu Gly He Lys 
78 5 790 795 800 

Gin Pro Val Ser Ala He Leu Glu Ala Ala Asp Glu His Arg Ala Asp 
805 810 815 

Val He Gly Met Ser Gly Leu Leu Val Lys Ser Thr Val He Met Lys 
820 825 830 

Glu Asn Leu Glu Glu Leu Asn Gin Arg Lys Leu Ala Ala Asp Tyr Pro 
835 840 845 

Val He Leu Gly Gly Ala Ala Leu Thr Arg Ala Tyr Val Glu Gin Asp 
850 855 860 
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Leu His Glu lie Tyr Asp Gly Glu Val Arg Tyr Ala Arg Asp Ala Phe 
8« 870 875 880 

Glii Gly Leu Arg Leu Met Asp Ala Leu lie Gly lie Lys Arg Gly Val 
885 890 895 

Pro Gly Ala Lys Leu Pro Glu Leu Lys Gin Arg Arg Val Arg Ala Ala 
900 905 910 

Thr Val Glu lie Asp Glu Arg Pro Glu Glu Gly His Val Arg Ser Asp 
915 920 925 

Val Ala Thr Asp Asn Pro Val Pro Thr Pro Pro Phe Arg Gly Thr Arg 
930 935 940 

Val Val Lys Gly lie Gin Leu Lys Glu Tyr Ala Ser Trp Leu Asp Glu 
945 950 955 960 

Gly Ala Leu Phe Lys Gly Gin Trp Gly Leu Lys Gin Ala Arg Thr Gly 
965 970 975 

Glu Gly Pro Ser Tyr Glu Glu Leu Val Glu Ser Glu Gly Arg Pro Arg 
980 985 990 

Leu Arg Gly Leu Leu Asp Arg Leu Gin Thr Asp Asn Leu Leu Glu Ala 
995 1000 1005 

Ala Val Val Tyr Gly Tyr Phe Pro Cys Val Ser Lys Asp Asp Asp Leu 
1010 1015 1020 

He Val Leu Asp Asp Asp Gly Asn Glu Arg Thr Arg Phe Thr Phe Pro 
1025 1030 1035 1040 

Arg Gin Arg Arg Gly Arg Arg Leu Cys Leu Ala Asp Phe Phe Arg Pro 
1045 1050 1055 

Glu Glu Ser Gly Glu Thr Asp Val Val Gly Phe Gin Val Val Thr Val 
1060 1065 1070 

Gly Ser Arg lie Gly Glu Glu Thr Ala Arg Met Phe Glu Ala Asn Ala 
1075 1080 1085 

Tyr Arg Asp Tyr Leu Glu Leu His Gly Leu Ser Val Gin Leu Ala Glu 
1090 1095 1100 

Ala Leu Ala Glu Tyr Trp His Ala Arg Val Arg Ser Glu Leu Gly Phe 
H05 1110 ins H20 

Ala Gly Glu Asp Pro Ala Glu Met Glu Asp Met Phe Ala Leu Lys Tyr 
1125 H30 H35 

Arg Gly Ala Arg Phe Ser Leu Gly Tyr Gly Ala Cys Pro Asp Leu Glu 
1140 H45 1150 

Asp Arg Ala Lys He Ala Ala Leu Leu Glu Pro Glu Arg He Gly Val 
H55 H60 H65 

His Leu Ser Glu Glu Phe Gin Leu His Pro Glu Gin Ser Thr Asp Ala 
1170 1175 1180 

He Val He His His Pro Glu Ala Lys Tyr Phe Asn Ala Arg 
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1185 H90 H95 

<210> 3 

<211> 3537 

<212> DNA 

<213> Anabaena sp. 

<220> 
<221> CDS 
<222> (1)..(3534) 
<223> RAN03790 

<400> 3 

atg act cat cct ttc ctg aaa cgc ctg cac agt ccg gaa ctt ccg gtt 48 
Met Thr His Pro Phe Leu Lys Arg Leu His Ser Pro Glu Leu Pro Val 
1 5 10 is 



ate gtc ttc gac ggt gca atg gga act aac eta caa acc caa aac etc 
He Val Phe Asp Gly Ala Met Gly Thr Asn Leu Gin Thr Gin Asn Leu 
20 25 30 



130 135 



140 



96 



acg get gag gat ttc ggc ggt gtg cag tat gaa ggt tgt aac gaa tac 144 
Thr Ala Glu Asp Phe Gly Gly Val Gin Tyr Glu Gly Cys Asn Glu Tyr 
35 40 45 

eta gtc cac acc aaa ccc gaa get gtc gee aag gtt cac cgc gac ttt 192 
Leu Val His Thr Lys Pro Glu Ala Val Ala Lys Val His Arg Asp Phe 
50 55 eo 

etc get gtg ggt gca gat gtc ate gaa acc gac act ttc ggt gcg aca 240 
Leu Ala Val Gly Ala Asp Val He Glu Thr Asp Thr Phe Gly Ala Thr 
65 70 75 80 

tec att gtt ttg gcg gaa tat gac tta gca gac caa aca tat tac ctg 288 
Ser He Val Leu Ala Glu Tyr Asp Leu Ala Asp Gin Thr Tyr Tyr Leu 
85 90 95 

aac aag aaa gec gee gaa ctg gcg aaa agt gtc get get gaa ttt tec 336 
Asn Lys Lys Ala Ala Glu Leu Ala Lys Ser Val Ala Ala Glu Phe Ser 
10 ° 105 no 

aca cca gat aaa ccc egg ttt gtt get ggt tec ate ggc ccc aca acc 384 
Thr Pro Asp Lys Pro Arg Phe Val Ala Gly Ser lie Gly Pro Thr Thr 
115 120 125 

aaa ctt ccc acc ttg gga cat ate gac ttt gac act etc aaa act tgc 
Lys Leu Pro Thr Leu Gly His lie Asp Phe Asp Thr Leu Lys Thr Cys 



432 



tt !? C ? a 9 ? a 933 9C9 Ct9 tta 9at 99* 99C gtg gat tta ctt 480 

Phe Ala Glu Gin Ala Glu Ala Leu Leu Asp Gly Gly Val Asp Leu Leu 
145 150 155 160 

ttg gtg gag act tgt caa gat gtg ctg caa ate aaa gcg gcg ctg aat 528 
Leu Val Glu Thr Cys Gin Asp Val Leu Gin lie Lys Ala Ala Leu Asn 
165 170 17S 

2? 9 «* If" 9 f a 9t ° tCt " C aag a9a 999 9 aa C 9 C at « ccc ttg atg 576 
Gly lie Glu Glu Val Phe Gly Lys Arg Gly Glu Arg lie Pro Leu Met 
180 185 " 190 
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gtg tec gtg aca atg gaa age atg ggg aca atg ttg gtc ggt tec gaa 624 
Val Ser Val Thr Met Glu Ser Met Gly Thr Met Leu Val Gly Ser Glu 
155 200 205 

ate aac gee gtc ctg aca att tta gaa cct ttc cca att gac att etc 
lie Asn Ala Val Leu Thr lie Leu Glu Pro Phe Pro lie Asp Ije Leu 
210 215 



672 



220 



720 



ggt ctg aac tgt gec aca ggc cca gac ttg atg aaa cca cat att aaa 
Gly Leu Asn Cys Ala Thr Gly Pro Asp Leu Met Lys Pro His He Lys 
225 230 235 240 . 

tat ttg get gaa cat teg ccg ttt gtg gtt tct tgt att cct aac gcg 768 
Tyr Leu Ala Glu His Ser Pro Phe Val Val Ser Cys He Pro Asn Ala 
245 2S0 * 255 

ggt tta cca gaa aac gtt ggt ggt caa gca cat tat cgc tta aca cca 816 
Gly Leu Pro Glu Asn Val Gly Gly Gin Ala His Tyr Arg Leu Thr Pro 
260 265 270 

atg gaa tta cgc atg gcg ttg atg cac ttt gtt gaa gat ttg ggt gtc 864 
Met Glu Leu Arg Met Ala Leu Met His Phe Val Glu Asp Leu Gly Val 
275 280 285 

caa gtg ate ggg ggt tgc tgt ggg aca cgt cca gaa cac att caa caa 912 
Gin Val He Gly Gly Cys Cys Gly Thr Arg Pro Glu His He Gin Gin 
290 295 300 

tta gca gaa att gec aag gat tta aag cca aag gtg aga cag cca agt 960 
Leu Ala Glu He Ala Lys Asp Leu Lys Pro Lys Val Arg Gin Pro Ser 
305 310 315 320 

tta gaa cct gcg get gca tea ata tat agt act caa ccc tac gaa caa 1008 
Leu Glu Pro Ala Ala Ala Ser lie Tyr Ser Thr Gin Pro Tyr Glu Gin 
325 330 335 

gat aat tct ttc ttg att gtg ggt gaa cgc etc aac gec agt ggt tec 1056 
Asp Asn Ser Phe Leu He Val Gly Glu Arg Leu Asn Ala Ser Gly Ser 
340 345 350 

aag aaa tgc cgt gat ttg ctg aat gcg gaa gat tgg gac gga ttg gta 1104 
Lys Lys Cys Arg Asp Leu Leu Asn Ala Glu Asp Trp Asp Gly Leu Val 
355 360 365 

tea atg gcg cga teg caa gtc aag gaa ggc gca cat ate ctt gat gtc 1152 
Ser Met Ala Arg Ser Gin Val Lys Glu Gly Ala His He Leu Asp Val 
370 375 380 

tit f St I** 9 ? a °" 93C 99t 9t9 099 9at at9 cac *** eta 1200 
Asn Val Asp Tyr Val Gly Arg Asp Gly Val Arg Asp Met His Glu Leu 

385 3 *> 395 400 

gtt tec cgc att gtg aat aat gtt aca etc ccc tta atg etc gac tec 1248 
Val Ser Arg He Val Asn Asn Val Thr Leu Pro Leu Met Leu Asp Ser 
405 4io 415 

ace gaa tgg gaa aag atg gag gcg ggt tta aag gtg get ggt ggt aag 1296 
Thr Glu Trp Glu Lys Met Glu Ala Gly Leu Lys Val Ala Gly Gly Lys 
4 20 425 430 

tgt ttg ctg aac tec acc aac tac gaa gat ggg gaa cca cgt ttc tta 1344 
Cys Leu Leu Asn Ser Thr Asn Tyr Glu Asp Gly Glu Pro Arg Phe Leu 



( 
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435 440 445 

aaa gtg ttg gag ttg gcg aag aaa tat ggc gcg ggt gtt gtt att ggc 1392 
Lys Val Leu Glu Leu Ala Lys Lys Tyr Gly Ala Gly Val Val lie Gly 
450 455 460 

aca att gac gaa gaa ggg atg gcg egg aca gec gag aaa aag ttt caa 1440 
Thr He Asp Glu Glu Gly Met Ala Arg Thr Ala Glu Lys Lys Phe Gin 
465 470 475 480 

att gec cag cgt gec tat cgt caa teg g.ta gaa tat ggg att ccc ccc 1488 
He Ala Gin Arg Ala Tyr Arg Gin Ser Val Glu Tyr Gly He Pro Pro 
485 490 495 

aca gaa ata ttc ttt gat acc tta get tta cca att tct acc ggg att 1536 
Thr Glu He Phe Phe Asp Thr Leu Ala Leu Pro He Ser Thr Gly He 
500 505 510 

gaa gaa gac egg gaa aat ggc aag gcg aca att gaa tea att age cgt 1584 
Glu Glu Asp Arg Glu Asn Gly Lys Ala Thr He Glu Ser He Ser Arg 
515 520 525 

ate cgt aaa gaa ttg cca ggg tgt cat gtt att tta ggc gtg tea aat 1632 
He Arg Lys Glu Leu Pro Gly Cys His Val He Leu Gly Val Ser Asn 
530 535 540 

ata tec ttt ggc tta aat tea gec teg egg atg gtc tta aac tec gtg 1680 
He Ser Phe Gly Leu Asn Ser Ala Ser Arg Met Val Leu Asn Ser Val 
545 550 555 560 

ttt etc cat gaa gca atg act get ggc atg gat gcg gcg ate gtc agt 1728 

Phe Leu His Glu Ala Met Thr Ala Gly Met Asp Ala Ala He Val Ser 

565 570 575 

'■i 

get age aag att eta cca ctg teg aag att gaa gag cgt cat caa gaa 1776 

Ala Ser Lys He Leu Pro Leu Ser Lys He Glu Glu Arg His Gin Glu 
580 585 590 

gtc tgc cgc cag tta att tat gac cag cgt aaa ttt gag ggt gat ate 1824 
Val Cys Arg Gin Leu He Tyr Asp Gin Arg Lys Phe Glu Gly Asp He 
595 600 605 

tgc ate tat gac ccc tta aca gaa eta act aaa ttg ttt gag gga gtc 1872 
Cys He Tyr Asp Pro Leu Thr Glu Leu Thr Lys Leu Phe Glu Gly Val 
610 615 620 

acc acc aaa cgt aac aaa ggc gtt gat gaa age tta ccc ate gaa gaa 1920 
Thr Thr Lys Arg Asn Lys Gly Val Asp Glu Ser Leu Pro He Glu Glu 
625 630 635 640 

cga etc aag cgt cac att ate gac ggc gaa cgc att ggt tta gaa gcg 1968 
Arg Leu Lys Arg His He He Asp Gly Glu Arg He Gly Leu Glu Ala 
645 650 655 

caa ctg aca aaa gee tta gaa caa tat cca ccc eta gaa att ate aac 2016 
Gin Leu Thr Lys Ala Leu Glu Gin Tyr Pro Pro Leu Glu He He Asn 
660 665 670 

act ttc eta eta gat ggg atg aaa gta gtc ggg gaa ttg ttc ggt tea 2064 
Thr Phe Leu Leu Asp Gly Met Lys Val Val Gly Glu Leu Phe Gly Ser 
675 680 685 
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gga caa atg cag eta cct ttc gtt tta cag tea gec gaa acc atg aaa 2112 
Gly Gin Met Gin Leu Pro Phe Val Leu Gin Ser Ala Glu Thr Met Lys 
690 695 700 

geef gcg gta gec tac eta gaa ccg ttc atg gaa aaa teg gaa' agt ggc 2160 
Ala Ala Val Ala Tyr Leu Glu Pro Phe Met Glu Lys Ser Glu Ser Gly 
705 710 715 



720 



820 825 



830 



2208 



aac aat gee aaa ggt aaa gta att att gee acc gtg aaa ggc gat gtt 
Asn Asn Ala Lys Gly Lys Val lie lie, Ala Thr Val Lys Gly Ab P Val 
725 730 735 

cac gac att ggt aaa aac eta gta gac att ate ttg tec aac aac ggc 2256 
His Asp He Gly Lys Asn Leu Val Asp He He Leu Ser Asn Asn Gly 
740 745 750 

tac aag gta att aac ctg gga att aaa cag ccg gtg gaa aat ate ate 2304 
Tyr Lys Val He Asn Leu Gly He Lys Gin Pro Val Glu Asn He He 
755 760 765 

gag get tac aac caa cac aaa get gat tgt att gec atg agt ggc ttg 2352 
Glu Ala Tyr Asn Gin His Lys Ala Asp Cys He Ala Met Ser Gly Leu 
770 775 780 

ctg gta aaa tec acc gca ttc atg aaa gaa aat ttg gag gtc ttc aac 2400 
Leu Val Lys Ser Thr Ala Phe Met Lys Glu Asn Leu Glu Val Phe Asn 
785 790 795 800 

gaa aaa ggc att aat gtt cct gta att tta ggt ggt gcg gca tta acc 2448 
Glu Lys Gly He Asn Val Pro Val He Leu Gly Gly Ala Ala Leu Thr 
805 810 815 

ccg aaa ttc gtg cat aaa gat tgc caa aat acc tac aaa ggt aaa gtc 2496 
Pro Lys Phe Val His Lys A B p Cys Gin Asn Thr Tyr Lys Gly Lys Val 



att tat ggc aaa gat get ttc tea gac ctg cat ttc atg gat aaa tta 2544 
He Tyr Gly Lys Asp Ala Phe Ser Asp Leu His Phe Met Asp Lys Leu 
835 840 845 

atg cca gee aaa gec act ggc aaa tgg gac aat tec tta gga ttc ttg 2592 
Met Pro Ala Lys Ala Thr Gly Lys Trp Asp Asn Ser Leu Gly Phe Leu 
850 855 860 

gat gaa gta gaa acc gag gaa aca gaa cct acc aat cac aaa tec cca 2640 
Asp Glu Val Glu Thr Glu Glu Thr Glu Pro Thr Asn His Lys Ser Pro 
865 870 875 880 

ate ccc agt ccc caa tec cca gtc ccc agt ccc cag tec cca gtc cct 2688 
He Pro Ser Pro Gin Ser Pro Val Pro Ser Pro Gin Ser Pro Val Pro 
885 890 895 



ata gac acc cga cgt tec gaa get gta gee ata gac att ccc cgt ccc 2736 
He Asp Thr Arg Arg Ser Glu Ala Val Ala He Asp He Pro Arg Pro 
500 905 910 

aca cca cca ttc tgg gga acg caa tta tta cag cct age gat att tec 2784 

Thr Pro Pro Phe Trp Gly Thr Gin Leu Leu Gin Pro Ser Asp He Ser 
915 920 925 

tta gag gaa ata ttc tgg cac atg gat ttg caa gee ttg att gcg gga 2832 
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Leu Glu Glu He Phe Trp His Met Asp Leu Gin Ala Leu He Ala Gly 

930 935 940 

caa tgg caa ttc cgc aaa ccc aaa gaa caa tea aag gaa gaa tat caa 

Gin Trp Gin Phe Arg Lys Pro Lys Glu Gin Ser Lys Glu Glu Tyr Gin 

945 950 955 960 

get ttc ttg aat gag aaa gtg tat cca gtt eta gaa act tgg aaa cag 

Ala Phe Leu Asn Glu Lys Val Tyr Pro Val Leu Glu Thr Trp Lys Gin 

965 970 975 



ttt cct tgt caa tct gag ggt aat act tta tat gtt tac gaa aca aac 
Phe Pro Cys Gin Ser Glu Gly Asn Thr Leu Tyr Val Tyr Glu Thr Asn 
995 1000 1005 



2880 



2928 



cgc ate att gca gaa aac ttg tta cat ccc cag gta att tat ggg tat 2976 
Arg He He Ala Glu Asn Leu Leu His Pro Gin Val He Tyr Gly Tyr 
980 985 99Q 



3024 



age cca aat gee aca gaa ate act cag ttt gaa ttc ccc cga caa aag 3072 
Ser Pro Asn Ala Thr Glu He Thr Gin Phe Glu Phe Pro Arg Gin Lys 
1010 1015 1020 

tea tea aaa cga tta tgt att gee gat ttc ttt gca ccg aaa gat tea 3120 
Ser Ser Lys Arg Leu Cys He Ala Asp Phe Phe Ala Pro Lys Asp Ser 
1025 1030 1035 1040 

gga ate att gat gtc ttc ccc atg cag gcg gtg act gta ggc gaa att 3168 
Gly He He Asp Val Phe Pro Met Gin Ala Val Thr Val Gly Glu He 
1045 1050 1055 

get aca gag ttc gcg caa aaa ttg ttt gca aac aat caa tac act gat 3216 
Ala Thr Glu Phe Ala Gin Lys Leu Phe Ala Asn Asn Gin Tyr Thr Asp 
1060 1065 1070 

tat ctg tat ttt cac ggt ttg gcg gtg caa gta gca gaa gec ttg gee 3264 
Tyr Leu Tyr Phe His Gly Leu Ala Val Gin Val Ala Glu Ala Leu Ala 
1075 1080 1085 

gag tgg aca cac gec aga ate cgc cgt gag tta ggg ttc ggt get gaa 3312 
Glu Trp Thr His Ala Arg He Arg Arg Glu Leu Gly Phe Gly Ala Glu 
1090 1095 iioo 

gaa ccg gat aat ate egg gat att ttg gca caa cgc tat cag ggt tec 3360 
Glu Pro Asp Asn He Arg Asp He Leu Ala Gin Arg Tyr Gin Gly Ser 
1105 mo 1115 U20 

egg tat agt ttt ggc tac cca get tgt ccc aat att caa gac cag ttt 3408 
Arg Tyr Ser Phe Gly Tyr Pro Ala Cys Pro Asn He Gin Asp Gin Phe 
H25 H30 H35 

aag cag ctg gat ttg ttg gag act age aga att aac tta tac atg gat 3456 
Lys Gin Leu Asp Leu Leu Glu Thr Ser Arg He Asn Leu Tyr Met Asp 
ll 40 1145 H50 

gaa agt gag caa ctt tat cca gaa cag tct acg acg gcg att att act 3504 
Glu Ser Glu Gin Leu Tyr Pro Glu Gin Ser Thr Thr Ala He He Thr 
H55 1160 lies 

tat cac cca gta get aag tac ttc acc gcg taa 3537 
Tyr His Pro Val Ala Lys Tyr Phe Thr Ala 
1170 1175 
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<210> 4 

<211> 1178 

<212> PRT 

<213> Anabaena sp. 

<400> 4 

Met Thr Hie Pro Phe Leu Lys Arg Leu His Ser Pro Glu Leu Pro Val 
1 5 io 15 

He Val Phe Asp Gly Ala Met Gly Thr ABn Leu Gin Thr Gin Asn Leu 
20 25 30 

Thr Ala Glu Asp Phe Gly Gly Val Gin Tyr Glu Gly Cys Asn Glu Tyr 
35 40 45 

Leu Val His Thr Lys Pro Glu Ala Val Ala Lys Val His Arg Asp Phe 
50 55 60 

Leu Ala Val Gly Ala Asp Val He Glu Thr Asp Thr Phe Gly Ala Thr 
65 70 75 80 

Ser He Val Leu Ala Glu Tyr Asp Leu Ala Asp Gin Thr Tyr Tyr Leu 
85 90 * 95 

Asn Lys Lys Ala Ala Glu Leu Ala Lys Ser Val Ala Ala Glu Phe Ser 
100 105 110 

Thr Pro Asp Lys Pro Arg Phe Val Ala Gly Ser He Gly Pro Thr Thr 
115 120 125 

Lys Leu Pro Thr Leu Gly His He Asp Phe Asp Thr Leu Lys Thr Cys 
130 135 140 

Phe Ala Glu Gin Ala Glu Ala Leu Leu Asp Gly Gly Val Asp Leu Leu 
145 150 155 160 

Leu Val Glu Thr Cys Gin Asp Val Leu Gin He Lys Ala Ala Leu Asn 
165 170 175 

Gly He Glu Glu Val Phe Gly Lys Arg Gly Glu Arg He Pro Leu Met 
180 185 190 

Val Ser Val Thr Met Glu Ser Met Gly Thr Met Leu Val Gly Ser Glu 
155 200 205 

He Asn Ala Val Leu Thr He Leu Glu Pro Phe Pro He Asp He Leu 
210 215 220 

Gly Leu Asn Cys Ala Thr Gly Pro Asp Leu Met Lys Pro His He Lys 
225 230 235 240 

Tyr Leu Ala Glu His Ser Pro Phe Val Val Ser Cys He Pro Asn Ala 
24 5 250 255 

Gly Leu Pro Glu Asn Val Gly Gly Gin Ala His Tyr Arg Leu Thr Pro 
260 265 270 

Met Glu Leu Arg Met Ala Leu Met His Phe Val Glu Asp Leu Gly Val 
275 280 285 
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Gin Val He Gly Gly Cys Cys Gly Thr Arg Pro Glu His He Gin Gin 
290 295 300 

Leu Ala Glu He Ala Lys Asp Leu Lys Pro Lys Val Arg Gin Pro Ser 
305 310 315 320 

Leu Glu Pro Ala Ala Ala Ser He Tyr Ser Thr Gin Pro Tyr Glu Gin 
325 330 335 

Asp Asn Ser Phe Leu He Val Gly Glu Arg Leu Asn Ala Ser Gly Ser 
340 345 350 

Lys Lys Cys Arg Asp Leu Leu Asn Ala Glu Asp Trp Asp Gly Leu Val 
355 360 365 

Ser Met Ala Arg Ser Gin Val Lys Glu Gly Ala His He Leu Asp Val 
370 375 380 

Asn Val Asp Tyr Val Gly Arg Asp Gly Val Arg Asp Met His Glu Leu 
385 390 395 400 

Val Ser Arg He Val Asn Asn Val Thr Leu Pro Leu Met Leu Asp Ser 
405 410 415 

Thr Glu Trp Glu Lys Met Glu Ala Gly Leu Lys Val Ala Gly Gly Lys 
420 425 430 

CyB Leu Leu Asn Ser Thr Asn Tyr Glu Asp Gly Glu Pro Arg Phe Leu 
435 440 445 

Lys Val Leu Glu Leu Ala Lys Lys Tyr Gly Ala Gly Val Val He Gly 
450 455 460 

Thr He Asp Glu Glu Gly Met Ala Arg Thr Ala Glu Lys Lys Phe Gin 
465 470 475 480 

lie Ala Gin Arg Ala Tyr Arg Gin Ser Val Glu Tyr Gly He Pro Pro 
485 490 495 

Thr Glu He Phe Phe Asp Thr Leu Ala Leu Pro He Ser Thr Gly He 
500 505 510 

Glu Glu Asp Arg Glu Asn Gly Lys Ala Thr He Glu Ser He Ser Arg 
515 520 525 

He Arg Lys Glu Leu Pro Gly Cys His Val He Leu Gly Val Ser Asn 
530 535 540 

He Ser Phe Gly Leu Asn Ser Ala Ser Arg Met Val Leu Asn Ser Val 
545 550 555 560 

Phe Leu His Glu Ala Met Thr Ala Gly Met Asp Ala Ala He Val Ser 
565 570 575 

Ala Ser Lys He Leu Pro Leu Ser Lys He Glu Glu Arg His Gin Glu 
580 585 590 

Val Cys Arg Gin Leu He Tyr Asp Gin Arg Lys Phe Glu Gly Asp He 
595 600 60S 

Cys He Tyr Asp Pro Leu Thr Glu Leu Thr Lys Leu Phe Glu Gly Val 
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610 615 620 

Thr Thr Lys Arg Asn Lys Gly Val Asp Glu Ser Leu Pro He Glu Glu 
€25 630 635 640 

Arg Leu Lys Arg His He He Asp Gly Glu Arg He Gly Leu Glu Ala 
645 650 655 

Gin Leu Thr Lys Ala Leu Glu Gin Tyr Pro Pro Leu Glu He He Asn 
660 665 670 

Thr Phe Leu Leu Asp Gly Met Lys Val Val Gly Glu Leu Phe Gly Ser 
675 680 685 

Gly Gin Met Gin Leu Pro Phe Val Leu Gin Ser Ala Glu Thr Met Lys 
690 695 . 700 

Ala Ala Val Ala Tyr Leu Glu Pro Phe Met Glu Lys Ser Glu Ser Gly 
705 710 715 720 

Asn Asn Ala Lys Gly Lys Val He He Ala Thr Val Lys Gly Asp Val 
725 730 735 

His Asp lie Gly Lys Asn Leu Val Asp lie lie Leu Ser Asn Asn Gly 
740 745 750 

Tyr Lys Val lie Asn Leu Gly lie Lys Gin Pro Val Glu Asn lie He 
755 760 765 

Glu Ala Tyr Asn Gin His Lys Ala Asp Cys lie Ala Met Ser Gly Leu 
770 775 780 

Leu Val Lys Ser Thr Ala Phe Met Lys Glu Asn Leu Glu Val Phe Asn 
785 790 795 800 

Glu Lys Gly lie Asn Val Pro Val lie Leu Gly Gly Ala Ala Leu Thr 
805 810 815 

Pro Lys Phe Val His Lys Asp Cys Gin Asn Thr Tyr Lys Gly Lys Val 
820 825 830 

lie Tyr Gly Lys Asp Ala Phe Ser Asp Leu His Phe Met Asp Lys Leu 
835 840 845 

Met Pro Ala Lys Ala Thr Gly Lys Trp Asp Asn Ser Leu Gly Phe Leu 
850 855 860 

Asp Glu Val Glu Thr Glu Glu Thr Glu Pro Thr Asn His Lys Ser Pro 
865 870 875 ' 860 

lie Pro Ser Pro Gin Ser Pro Val Pro Ser Pro Gin Ser Pro Val Pro 
885 890 895 

He Asp Thr Arg Arg Ser Glu Ala Val Ala lie Asp He Pro Arg Pro 
900 905 910 

Thr Pro Pro Phe Trp Gly Thr Gin Leu Leu Gin Pro Ser Asp He Ser 
315 920 925 

Leu Glu Glu He Phe Trp His Met Asp Leu Gin Ala Leu lie Ala Gly 
930 935 940 



( 
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Gin Trp Gin Phe Arg Lys Pro Lys Glu Gin Ser Lys Glu Glu Tyr Gin 
945 950 955 960 

Ala Phe Leu Asn Glu Lys Val Tyr Pro Val Leu Glu Thr Trp Lys Gin 
965 970 975 

Arg He He Ala Glu Asn Leu Leu His Pro Gin Val He TVr Glv Tvr 
980 985 990 

Phe Pro Cys Gin Ser Glu Gly Asn Thr Leu Tyr Val Tyr Glu Thr Asn 
995 1000 1005 

Ser Pro Asn Ala Thr Glu He Thr Gin Phe Glu Phe Pro Arg Gin Lys 
1010 1015 1020 

Ser Ser Lys Arg Leu Cys He Ala Asp Phe Phe Ala Pro Lys Asp Ser 
1025 1030 1035 1040 

Gly He He Asp Val Phe Pro Met Gin Ala Val Thr Val Gly Glu lie 
10 *5 1050 1055 

Ala Thr Glu Phe Ala Gin Lys Leu Phe Ala Asn Asn Gin Tyr Thr Asp 
1060 1065 1070 

Tyr Leu Tyr Phe His Gly Leu Ala Val Gin Val Ala Glu Ala Leu Ala 
1075 1080 1085 

Glu Trp Thr His Ala Arg He Arg Arg Glu Leu Gly Phe Gly Ala Glu 
10*0 1095 iioo 

Glu Pro Asp Asn He Arg Asp He Leu Ala Gin Arg Tyr Gin Gly Ser 
1105 HI© 1115 1120 

Arg Tyr Ser Phe Gly Tyr Pro Ala Cys Pro Asn He Gin Asp Gin Phe 
H25 H30 H35 

Lys Gin Leu Asp Leu Leu Glu Thr Ser Arg He Asn Leu Tyr Met Asp 
1140 H45 iiso 

Glu Ser Glu Gin Leu Tyr Pro Glu Gin Ser Thr Thr Ala He He Thr 
I 155 H60 H65 

Tyr His Pro Val Ala Lys Tyr Phe Thr Ala 
1170 U75 



<210> 5 
<211> 3588 
<212> DNA 

<213> Synechocystis sp. 

<220> 
<221> CDS 
<222> (1) . . (3585) 
<223> RCY35965 

<400> 5 

atg aaa agt get ttt tta gac cgt 
Met Lys Ser Ala Phe Leu Asp Arg 
1 5 



ate cac agt ccc gat cgc ccg gta 48 
He His Ser Pro Asp Arg Pro Val 
10 i 5 



gtc ttt gac ggg get atg ggt aca aac ctg cag gta cag aac eta 96 
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Leu Val Phe Asp Gly Ala Met Gly Thr Asn Leu Gin Val Gin Asn Leu 
20 25 30 

acg gcg gcg gat ttt ggt ggg gcg gaa tac gaa ggt tgc. aat gaa tat 144 
Thr Ala Ala Asp Phe Gly Gly Ala Glu Tyr Glu Gly Cys Asn Glu Tyr 
35 40 45 

tta gtc cat acc aag cca gag gcc gtg get acg gtg cat cgt get ttt 192 
Leu Val His Thr Lys Pro Glu Ala Val Ala Thr Val His Arg Ala Phe 
50 55 60 

tac gaa gcg ggg gcc gat gtc gtg gaa acg gat act ttt ggg gga acg 240 
Tyr Glu Ala Gly Ala Asp Val Val Glu Thr Asp Thr Phe Gly Gly Thr 
6 5 70 75 80 

ccc ctg gtg ctg gcg gag tac gat tta gca gac caa agt tat tac tta 288 
Pro Leu Val Leu Ala Glu Tyr Asp Leu Ala Asp Gin Ser Tyr Tyr Leu 
85 90 95 

aat aaa gca gcg gcg gag ttg gcc aag gcg gta gca gcg gaa ttt tct 336 
Asn Lys Ala Ala Ala Glu Leu Ala Lys Ala Val Ala Ala Glu Phe Ser 
100 105 110 

acc cca gaa aag cct cga ttc gtg gcc ggc tec atg gga cca ggc acc 384 
Thr Pro Glu Lys Pro Arg Phe Val Ala Gly Ser Met Gly Pro Gly Thr 
115 120 125 

aag eta ccc acc eta ggt cat gtg gac tac gat agt etc aag gat gcc 432 
Lys Leu Pro Thr Leu Gly His Val Asp Tyr Asp Ser Leu Lys' Asp Ala 
130 135 140 

tat gtg gtt cag gtg egg ggt tta tac gat ggc gga gtg gat tta ttg 480 
Tyr Val Val Gin Val Arg Gly Leu Tyr Asp Gly Gly Val Asp Leu Leu 
14 5 150 155 160 

eta gtg gaa acc tgc cag gat gtg ctg caa att aaa gcg gcc ttg aac 526 
Leu Val Glu Thr Cys Gin Asp Val Leu Gin He Lys Ala Ala Leu Asn 
165 170 175 

gcc att gaa cag gtc ttt gcc gaa aaa ggc gat cgc eta ccg ttg atg 576 
Ala He Glu Gin Val Phe Ala Glu Lys Gly Asp Arg Leu Pro Leu Met 
180 185 * " 190 

gtg tea gta acc atg gaa acc atg ggg acc atg ctg gtg ggt acg gag 624 
Val Ser Val Thr Met Glu Thr Met Gly Thr Met Leu Val Gly Thr Glu 
195 200 205 

atg gcg gcg gcc ctg gcc att ttg gag ccc tat ccc ate gat att ttg 672 
Met Ala Ala Ala Leu Ala He Leu Glu Pro Tyr Pro He Asp He Leu 
210 215 220 

ggg eta aac tgc gcc acc ggg cca gat ttg atg aag gaa cac gtt aaa 720 
Gly Leu Asn Cys Ala Thr Gly Pro Asp Leu Met Lys Glu His Val Lys 
225 230 235 240 

tat ctt tec gaa cat tec ccc ttt gtg gtg tec tgt att ccc aat get 768 
Tyr Leu Ser Glu His Ser Pro Phe Val Val Ser Cys He Pro Asn Ala 
245 250 255 

ggt ttg cca gaa aac gtt ggc ggt caa get ttt tat cgc etc acc ccg 816 
Gly Leu Pro Glu Asn Val Gly Gly Gin Ala Phe Tyr Arg Leu Thr Pro 
260 265 270 
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atg gaa ctg caa atg tec ctg atg cac ttc ate gaa gac ctg gga gta 864 
Met Glu Leu Gin Met Ser Leu Met His Phe lie Glu Asp Leu Gly Val 
275 280 285 

cag gta att ggt ggt tgt tgt ggc act aga ccc gat cac ate aag gee 912 
Gin Val lie Gly Gly Cys Cys Gly Thr Arg Pro Asp His He Lys Ala 
290 295 300 



ctg gcg gat att gec aag gat etc cag ccc aaa caa cgc caa cct cac 
Leu Ala Asp He Ala Lys Asp Leu Gin Pro Lys Gin Arg Gin Pro His 
305 310 315 320 



960 



tac gaa ccc age gec get tec att tat tec acc caa ace tac gec caa 1008 
Tyr Glu Pro Ser Ala Ala Ser He Tyr Ser Thr Gin Thr Tyr Ala Gin 
325 330 335 

gaa aat tct ttt tta ate att ggc gaa egg etc aat gee agt ggc teg 1056 
Glu Asn Ser Phe Leu He He Gly Glu Arg Leu Asn Ala Ser Gly Ser 
340 345 350 

aaa aaa tgt cga gat ctg etc aat get gaa gat tgg gac age eta gtt 1104 
Lys Lys Cys Arg Asp Leu Leu Asn Ala Glu Asp Trp Asp Ser Leu Val 
355 360 365 

tec ctg get aaa tec caa gtc aag gaa gga gec caa ate ctt gac gtc 1152 
Ser Leu Ala Lys Ser Gin Val Lys Glu Gly Ala Gin He Leu Asp Val 
370 375 300 

aac gtg gat tac gtt ggt cga gat ggg gta agg gac atg aaa gaa tta 1200 
Asn Val Asp Tyr Val Gly Arg Asp Gly Val Arg Asp Met Lys Glu Leu 
385 390 395 400 

get tec cga eta gtc aat aat gtc acc ctg ccg ttg atg ttg gac tec 124 8 
Ala Ser Arg Leu Val Asn Asn Val Thr Leu Pro Leu Met Leu Asp Ser 
405 410 415 

acc gaa tgg caa aaa atg gag gcg ggt tta aaa gtt gca ggg gga aaa 1296 
Thr Glu Trp Gin Lys Met Glu Ala Gly Leu Lys Val Ala Gly Gly Lys 
420 425 430 

tgt att etc aat tec acc aac tac gaa gac ggg gaa gaa egg ttt tat 1344 
Cys He Leu Asn Ser Thr Asn Tyr Glu Asp Gly Glu Glu Arg Phe Tyr 
435 440 445 

aaa gtg tta gaa att gec aaa gaa tat gga get ggt att gtc att ggc 1392 
Lys Val Leu Glu He Ala Lys Glu Tyr Gly Ala Gly He Val He Gly 
450 455 460 

acc ate gat gaa gat ggc atg gga cgc act gca gat aaa aaa ttt gag 1440 
Thr He Asp Glu Asp Gly Met Gly Arg Thr Ala Asp Lys Lys Phe Glu 
465 470 475 480 

att gee aaa egg gec tac gaa gcg gcg ate gec ttt ggc att ccg gee I486 
He Ala Lys Arg Ala Tyr Glu Ala Ala He Ala Phe Gly He Pro Ala 
485 490 495 

aca gaa att ttc ttt gat cct tta get ctg cct att tec acc ggc att 1536 
Thr Glu He Phe Phe Asp Pro Leu Ala Leu Pro He Ser Thr Gly He 
500 505 510 

gaa gaa gac agg gag aac ggt aaa gee acc gtg gat get ate cgc aga 1584 
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Glu Glu Asp Arg Glu Asn Gly Lys Ala Thr Val Asp Ala He Arg Arg 
515 520 525 

att cgc cag gaa ttg ccc gat tgt cat att ttg ttg ggg gtt ,tct aac 1632 
He Arg Gin Glu Leu Pro Asp Cys His He Leu Leu Gly Val Ser Asn 
530 535 540 

gtt tec ttt ggc ttg aat ccc gec get cgc cag gta etc aat tec ate 1680 
Val Ser Phe Gly Leu Asn Pro Ala Ala Arg Gin Val Leu Asn Ser He 
545 550 555 560 

ttt etc cac gaa tgt atg cag gtg ggc atg gat gcg gee att gtc agt 1728 
Phe Leu His Glu Cys Met Gin Val Gly Met Asp Ala Ala He Val Ser 
565 570 575 

gec aat aag att tta ccc ctg gca aaa att gac cca gaa caa caa caa 1776 
Ala Asn Lys He Leu Pro Leu Ala Lys He Asp Pro Glu Gin Gin Gin 
580 585 590 

gtc tgt eta gat tta ate tat gac cgc egg gaa ttt gaa gga gag cgc 1824 
Val Cys Leu Asp Leu He Tyr Asp Arg Arg Glu Phe Glu Gly Glu Arg 
595 600 60S 

tgt aca tat gac ccg tta acc aaa etc ace act tta ttt gaa ggt aaa 1872 
Cys Thr Tyr Asp Pro Leu Thr Lys Leu Thr Thr Leu Phe Glu Gly Lys 
610 615 620 

acc acc aaa egg gat aaa tec ggt gat gee aat tta ccg gtg, gaa gaa 1920 
Thr Thr Lys Arg Asp Lys Ser Gly Asp Ala Asn Leu Pro Val Glu Glu 
625 630 635 640 

aga tta aaa cgc cac ate att gat ggg gaa aga ttg ggc tta gaa gag 1968 
Arg Leu Lys Arg His He He Asp Gly Glu Arg Leu Gly Leu Glu Olu 
645 650 655 

gee etc aat gaa get tta aaa ctt tac get ccc tta gat ate att aac 2016 
Ala Leu Asn Glu Ala Leu Lys Leu Tyr Ala Pro Leu Asp He He Asn 
660 665 670 

ate tat ttg ttg gat ggc atg aaa gtg gtg ggg gaa eta ttt ggt tec 2064 
He Tyr Leu Leu Asp Gly Met Lys Val Val Gly Glu Leu Phe Gly Ser 
675 680 685 

ggg caa atg cag ttg ccc ttt gtg ttg cag teg gec caa acc atg aaa 2112 
Gly Gin Met Gin Leu Pro Phe Val Leu Gin Ser Ala Gin Thr Met Lys 
690 695 700 

gcg gcg gtg get ttt tta gaa ccc cat atg gat aag gat gat tec gee 2160 
Ala Ala Val Ala Phe Leu Glu Pro His Met Asp Lys Asp Asp Ser Ala 
705 710 715 720 

gac aat get aag ggt act ttt tta att gee act gtt aag ggg gat gtc 2208 
Asp Asn Ala Lys Gly Thr Phe Leu He Ala Thr Val Lys Gly Asp Val 
725 730 735 

cat gat att ggc aaa aac tta gtg gat att ate ctt tec aac aat ggc 2256 
His Asp He Gly Lys Asn Leu Val Asp He He Leu Ser Asn Asn Gly 
740 745 750 

tat cga gtg gtc aac eta ggc att aaa cag cca gtg gaa aat att ate 2304 
Tyr Arg Val Val Asn Leu Gly He Lys Gin Pro Val Glu Asn He He 
755 760 765 



( 
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gaa gcc tac aaa aaa cac agg ccc gat tgc att gcc atg agt ggt ttg 2352 
Glu Ala Tyr Lys Lys His Arg Pro Asp Cys lie Ala Met Ser Gly Leu 
770 775 780 

ttg gtc aaa tea act get ttt atg aag gaa aat tta gaa gtt ttc aac 2400 
Leu Val Lys Ser Thr Ala Phe Met Lys Glu Asn Leu Glu Val Phe Asn 
785 790 795 800 

caa gag ggc att act gtt ccc gtc att ctt ggt ggt get get tta acg 2448 
Gin Glu Gly He Thr Val Pro Val He Leu Gly Gly Ala Ala Leu Thr 
805 810 815 

cct aaa ttt gtt cac cag gac tgc caa aat acc tac aaa ggc caa gta 2496 
Pro Lys Phe Val His Gin Asp Cys Gin Asn Thr Tyr Lys Gly Gin Val 
820 825 * 830 

att tac ggc aaa gat gcg ttc gcc gat tta cat ttc atg gat aag eta 2544 
He Tyr Gly Lys Asp Ala Phe Ala Asp Leu His Phe Met Asp Lys Leu 
835 840 845 

atg ccc get aaa aat age cac aat tgg gat gat ttc cag ggc ttt tta 2592 
Met Pro Ala Lys Asn Ser His Asn Trp Asp Asp Phe Gin Gly Phe Leu 
850 855 860 

ggg gaa tat gca acg gaa aat ggc cat aat gtg acc act gat gat ggt 2640 
Gly Glu Tyr Ala Thr Glu Asn Gly His Asn Val Thr Thr Asp Asp Gly 
865 870 875 880 

get aaa act aat ttt ggc att gaa gaa gaa aaa tta att gac get agt 2686 
Ala Lys Thr Asn Phe Gly He Glu Glu Glu Lys Leu He Asp Ala Ser 
885 890 * 895 

gag cag tct agg gag ccg gag gta att gat act gtt cgt tct gaa gcg 2736 
Glu Gin Ser Arg Glu Pro Glu Val He Asp Thr Val Arg Ser Glu Ala 
900 905 910 

gtg gac cct gat eta gaa aga cct gtg cca cct ttt tgg ggc act aaa 2784 
Val Asp Pro Asp Leu Glu Arg Pro Val Pro Pro Phe Trp Gly Thr Lys 
915 920 925 

att ttg caa tec agt gat att tec etc gat gaa gtc ttc cct tta ctg 2832 
He Leu Gin Ser Ser Asp He Ser Leu Asp Glu Val Phe Pro Leu Leu 
930 935 " 940 

gat tta caa gca tta ttt gtt ggt cag tgg cag ttt cgc aaa cct agg 2880 
Asp Leu Gin Ala Leu Phe Val Gly Gin Trp Gin Phe Arg Lys Pro Arg 
945 950 955 960 

gag caa tec agg gaa gaa tac gag caa ttc eta gcg gaa aaa gtt cat 2928 
Glu Gin Ser Arg Glu Glu Tyr Glu Gin Phe Leu Ala Glu Lys Val His 
965 970 975 

ccc att ttg get gag tgg aaa ggt aag gtc atg gca gaa aat tta etc 2976 
Pro He Leu Ala Glu Trp Lys Gly Lys Val Met Ala Glu Asn Leu Leu 
980 985 990 

cat cct acg gtg gtt tat ggt tat ttt ccc tgt caa tec cag ggc aat 3024 
His Pro Thr Val Val Tyr Gly Tyr Phe Pro Cys Gin Ser Gin Gly Asn 
995 1000 1005 

acc ttg tta att tat gac cca gaa ttg gtc age caa aat aat ggc caa 3072 
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Thr Leu Leu lie Tyr Asp Pro Glu Leu Val Ser Gin Asn Asn Gly Gin 
1010 1015 1020 

att ccc cca gac gca acg gcg ate gec aaa ttt gag ttt ccc, egg caa 3120 
He* Pro Pro Asp Ala Thr Ala lie Ala Lys Phe Glu Phe Pro Arg Gin 
1025 1030 1035 • 1040 

aaa tea ggg egg egg etc tgt att gcg gac ttt ttt get tea aaa gaa 3168 
Lys Ser Gly Arg Arg Leu Cys He Ala Asp Phe Phe Ala Ser Lys Glu 
1045 1050 ,'1055 

teg ggg att act gat gtt ttt cct ttg caa gcg gtt aca gtg ggg gaa 3216 
Ser Gly He Thr Asp Val Phe Pro Leu Gin Ala Val Thr Val Gly Glu 
1060 1065 1070 

ate gcg acg gaa tat gca agg aaa ctt ttt get ggc gat aat tac acc 3264 
He Ala Thr Glu Tyr Ala Arg Lys Leu Phe Ala Gly Asp Asn Tyr Thr 
1075 1080 1085 

gat tac etc tac ttc cac ggc atg gcg gtg cag atg gcg gaa get tta 3312 
Asp Tyr Leu Tyr Phe His Gly Met Ala Val Gin Met Ala Glu Ala Leu 
1090 1095 1100 

gcg gag tgg act cac caa egg ata cgt cag gaa ttg ggc ttt ggc cat 3360 
Ala Glu Trp Thr His Gin Arg He Arg Gin Glu Leu Gly Phe Gly His 
1105 1110 1115 1120 

tta gat cca gat aac ate cgt gat ctt etc cag caa cgt tac caa ggt 3408 
Leu Asp Pro Asp Asn He Arg Asp Leu Leu Gin Gin Arg Tyr Gin Gly 
1125 1130 H35 

tec cgc tac agt ttt ggt tat ccc get tgt ccc aac atg cag gat caa 3456 
Ser Arg Tyr Ser Phe Gly Tyr Pro Ala Cys Pro Asn Met Gin Asp Gin 
1140 1145 1150 

tac aca caa tta gaa ttg tta caa acc gaa cga att ggc ttg tat atg 3504 
Tyr Thr Gin Leu Glu Leu Leu Gin Thr Glu Arg lie Gly Leu Tyr Met 
1155 1160 *" H65 

gat gaa agt gaa cag gtt tat cca gaa caa tec acc acg gcg att att 3552 
Asp Glu Ser Glu Gin Val Tyr Pro Glu Gin Ser Thr Thr Ala He He 
1170 1175 1180 

tec tat cat cct gcg get aaa tat ttc age get taa 3 588 

Ser Tyr His Pro Ala Ala Lys Tyr Phe Ser Ala 
1185 1190 1195 



<210> 6 
<211> 1195 
<212> PRT 

<213> Synechocystis sp. 
<400> 6 

Met Lys Ser Ala Phe Leu Asp Arg He His Ser Pro Asp Arg Pro Val 
15 10 15 

Leu Val Phe Asp Gly Ala Met Gly Thr Asn Leu Gin Val Gin Asn Leu 
20 25 30 



Thr Ala Ala Asp Phe Gly Gly Ala Glu Tyr Glu Gly Cys Asn Glu Tyr 
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35 40 45 

Leu Val His Thr Lys Pro Glu Ala Val Ala Thr Val His Arg Ala Phe 
50 55 60 

Tyr Glu Ala Gly Ala Abp Val Val Glu Thr Asp Thr Phe Gly Gly Thr 
*5 70 75 80 

Pro Leu Val Leu Ala Glu Tyr Asp Leu Ala Asp Gin Ser Tyr Tyr Leu 
85 90 95 

Asn Lys Ala Ala Ala Glu Leu Ala Lys Ala Val Ala Ala Glu Phe Ser 
100 105 no 

Thr Pro Glu Lys Pro Arg Phe Val Ala Gly Ser Met Gly Pro Gly Thr 
115 120 125 

Lys Leu Pro Thr Leu Gly His Val Asp Tyr Asp Ser Leu Lys Asp Ala 
130 135 140 

Tyr Val Val Gin Val Arg Gly Leu Tyr Asp Gly Gly Val Asp Leu Leu 
"5 150 155 160 

Leu Val Glu Thr Cys Gin Asp Val Leu Gin lie Lys Ala Ala Leu Asn 
165 170 175 

Ala He Glu Gin Val Phe Ala Glu Lys Gly Asp Arg Leu Pro Leu Met 
180 185 190 

Val Ser Val Thr Met Glu Thr Met Gly Thr Met Leu Val Gly Thr Glu 
195 200 205 

Met Ala Ala Ala Leu Ala He Leu Glu Pro Tyr Pro He Asp He Leu 
210 215 220 

Gly Leu Asn Cys Ala Thr Gly Pro Asp Leu Met Lys Glu His Val Lys 
225 230 235 240 

Tyr Leu Ser Glu His Ser Pro Phe Val Val Ser Cys He Pro Asn Ala 
245 250 255 

Gly Leu Pro Glu Asn Val Gly Gly Gin Ala Phe Tyr Arg Leu Thr Pro 
260 265 270 

Met Glu Leu Gin Met Ser Leu Met His Phe lie Glu Asp Leu Gly Val 
275 280 285 

Gin Val He Gly Gly Cys Cys Gly Thr Arg Pro Asp His He Lys Ala 
290 29S 300 

Leu Ala Asp He Ala Lys Asp Leu Gin Pro Lys Gin Arg Gin Pro His 
305 310 315 ~ 320 

Tyr Glu Pro Ser Ala Ala Ser He Tyr Ser Thr Gin Thr Tyr Ala Gin 
325 330 335 

Glu Asn Ser Phe Leu He He Gly Glu Arg Leu Asn Ala Ser Gly Ser 
340 345 350 

Lys Lys Cys Arg Asp Leu Leu Asn Ala Glu Asp Trp Asp Ser Leu Val 
355 360 365 
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Ser Leu Ala Lys Ser Gin Val Lys Glu Gly Ala Gin He Leu Asp Val 
370 375 -380 

Asn Val Asp Tyr Val Gly Arg Asp Gly Val Arg Asp Met Lys, Glu Leu 
385 390 395 400 

Ala Ser Arg Leu Val Asn Asn Val Thr Leu Pro Leu Met Leu Asp Ser 
405 410 415 

Thr Glu Trp Gin Lys Met Glu Ala Gly Leu Lys Val Ala Gly Gly Lys 
420 425 430 

Cys He Leu Asn Ser Thr Asn Tyr Glu Asp Gly Glu Glu Arg Phe Tyr 
435 440 445 

Lys Val Leu Glu lie Ala Lys Glu Tyr Gly Ala Gly He Val He Gly 
450 455 460 

Thr He Asp Glu Asp Gly Met Gly Arg Thr Ala Asp Lys Lys Phe Glu 
465 470 475 " " 480 

He Ala Lys Arg Ala Tyr Glu Ala Ala He Ala Phe Gly He Pro Ala 
485 490 495 

Thr Glu He Phe Phe Asp Pro Leu Ala Leu Pro He Ser Thr Gly He 
500 505 510 

Glu Glu Asp Arg Glu Asn Gly Lys Ala Thr Val Asp Ala He Arg Arg 
515 520 525 

He Arg Gin Glu Leu Pro Asp Cys His He Leu Leu Gly Val Ser Asn 
530 535 540 

Val Ser Phe Gly Leu Asn Pro Ala Ala Arg Gin Val Leu Asn Ser He 
545 550 555 560 

Phe Leu His Glu Cys Met Gin Val Gly Met Asp Ala Ala He Val Ser 
565 570 * 575 

Ala Asn Lys lie Leu Pro Leu Ala Lys lie Asp Pro Glu Gin Gin Gin 
580 585 590 

Val Cys Leu Asp Leu He Tyr Asp Arg Arg Glu Phe Glu Gly Glu Arg 
595 600 605 

Cys Thr Tyr Asp Pro Leu Thr Lys Leu Thr Thr Leu Phe Glu Gly LyB 
610 615 620 

Thr Thr Lys Arg Asp Lys Ser Gly Asp Ala Asn Leu Pro Val Glu Glu 
625 630 635 640 

Arg Leu Lys Arg His He He Asp Gly Glu Arg Leu Gly Leu Glu Glu 
645 650 655 

Ala Leu Asn Glu Ala Leu Lys Leu Tyr Ala Pro Leu Asp lie He Asn 
660 665 670 

He Tyr Leu Leu Asp Gly Met Lys Val Val Gly Glu Leu Phe Gly Ser 
675 680 685 

Gly Gin Met Gin Leu Pro Phe Val Leu Gin Ser Ala Gin Thr Met Lys 
690 695 700 



WO 03/087386 PCT/EP03/04010 

26 

Ala Ala Val Ala Phe Leu Glu Pro His Met Asp Lys Asp Asp Ser Ala 
705 710 715 720 

Asp Asn Ala Lys Gly Thr Phe Leu He Ala Thr Val Lys Gly Asp Val 
? 25 730 735 

His Asp He Gly Lys Asn Leu Val Asp He He Leu Ser Asn Asn Gly 
740 745 750 

Tyr Arg Val Val Asn Leu Gly He Lys Gin Pro Val Glu Asn He He 
755 760 765 

Glu Ala Tyr Lys Lys His Arg Pro Asp Cys He Ala Met Ser Gly Leu 
770 775 780 

Leu Val Lys Ser Thr Ala Phe Met Lys Glu Asn Leu Glu Val Phe Asn 
785 790 795 800 

Gin Glu Gly lie Thr Val Pro Val He Leu Gly Gly Ala Ala Leu Thr 
805 810 815 

Pro Lys Phe Val His Gin Asp Cys Gin Asn Thr Tyr Lys Gly Gin Val 

820 825 ... 830 

He Tyr Gly Lys Asp Ala Phe Ala Asp Leu His Phe Met Asp Lys Leu 
835 840 845 

Met Pro Ala Lys Asn Ser His Asn Trp Asp Asp Phe Gin Gly Phe Leu 
850 855 860 

Gly Glu Tyr Ala Thr Glu Asn Gly HIb Asn Val Thr Thr Asp Asp Gly 
865 870 875 880 

Ala Lys Thr Asn Phe Gly He Glu Glu Glu Lys Leu He Asp Ala Ser 
885 890 895 

Glu Gin Ser Arg Glu Pro Glu Val He Asp Thr Val Arg Ser Glu Ala 
900 905 910 

Val Asp Pro Asp Leu Glu Arg Pro Val Pro Pro Phe Trp Gly Thr Lys 
915 920 925 

He Leu Gin Ser Ser Asp He Ser Leu Asp Glu Val Phe Pro Leu Leu 
930 935 940 

Asp Leu Gin Ala Leu Phe Val Gly Gin Trp Gin Phe Arg Lys Pro Arg 
945 950 955 960 

Glu Gin Ser Arg Glu Glu Tyr Glu Gin Phe Leu Ala Glu Lys Val His 
965 970 975 

Pro He Leu Ala Glu Trp Lys Gly Lys Val Met Ala Glu Asn Leu Leu 
980 985 990 

His Pro Thr Val Val Tyr Gly Tyr Phe Pro Cys Gin Ser Gin Gly Asn 
995 1000 1005 

Thr Leu Leu He Tyr Asp Pro Glu Leu Val Ser Gin Asn Asn Gly Gin 
1010 1015 1020 

He Pro Pro Asp Ala Thr Ala He Ala Lys Phe Glu Phe Pro Arg Gin 
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1025 1030 1035 1040 

Lys Ser Gly Arg Arg Leu Cys He Ala Asp Phe Phe Ala Ser Lys Glu 
1045 1050 1055 

Ser Gly He Thr Asp Val Phe Pro Leu Gin Ala Val Thr Val Gly Glu 
1060 1065 1070 

He Ala Thr Glu Tyr Ala Arg Lys Leu Phe Ala Gly Asp Asn Tyr Thr 
1075 1080 1085 

Asp Tyr Leu Tyr Phe His Gly Met Ala Val Gin Met Ala Glu Ala Leu 
1090 1095 uoo 

Ala Glu Trp Thr His Gin Arg He Arg Gin Glu Leu Gly Phe Gly His 
H05 1110 ins 1120 

Leu Asp Pro Asp Asn He Arg Asp Leu Leu Gin Gin Arg Tyr Gin Gly 
1125 1130 1135 

Ser Arg Tyr Ser Phe Gly Tyr Pro Ala Cys Pro Asn Met Gin Asp Gin 
1140 1145 H50 

Tyr Thr Gin Leu Glu Leu Leu Gin Thr Glu Arg He Gly Leu Tyr Met 
1155 1160 H65 

Asp Glu Ser Glu Gin Val Tyr Pro Glu Gin Ser Thr Thr Ala lie He 
1170 1175 1180 

Ser Tyr His Pro Ala Ala Lys Tyr Phe Ser Ala 
1185 1190 H95 



<210> 7 
<211> 3561 
<212> DNA 

<213> Prochlorococcus marinus 

<220> 
<221> CDS 
<222> (1) . . (3558) 
<223> RCK00830 

<400> 7 

atg gtt tea ttt aga aat tat tta aat aga gat gat aaa cca att att 48 

Met Val Ser Phe Arg Asn Tyr Leu Asn Arg Asp Asp Lys Pro He He 
1 5 10 is 

att ttc gat ggt ggg aca ggt act tct ttt caa aat tta aat tta tea 96 
He Phe Asp Gly Gly Thr Gly Thr Ser Phe Gin Asn Leu Asn Leu Ser 
20 25 30 

tea cat gat ttt ggt gga gat gat tta gag ggt tgc aat gaa aac tta 144 
Ser His Asp Phe Gly Gly Asp Asp Leu Glu Gly Cys Asn Glu Asn Leu 
35 40 45 

gtt eta tec tct cct aat act gtt gaa caa gta cat aat tea ttt ctt 192 
Val Leu Ser Ser Pro Asn Thr Val Glu Gin Val His Asn Ser Phe Leu 
50 55 60 

gaa gca ggt tgt cat gta att gaa acc aat aca ttt ggt get tea tct 240 
Glu Ala Gly Cys His Val He Glu Thr Asn Thr Phe Gly Ala Ser Ser 
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65 70 75 80 

att gtt tta gac gaa tat agt att tct aat aaa get tat gaa ate aat 288 
He Val Leu Asp Glu Tyr Ser He Ser Asn Lys Ala Tyr Glu He Asn 
85 90 95 

aaa aaa gca get cag ata get aaa aaa tgt gca aat tta ttt tea tct 336 
Lys Lys Ala Ala Gin He Ala Lys Lys Cys Ala Asn Leu Phe Ser Ser 
100 105 no 

att aat act cct aga ttt gtc get gga tea att ggg cca act aca aaa 384 
He Asn Thr Pro Arg Phe Val Ala Gly Ser He Gly Pro Thr Thr Lys 
115 120 125 

tta cca aca tta ggt cat att agt ttt gat aag ctt aaa gat tea tat 432 
Leu Pro Thr Leu Gly His He Ser Phe Asp Lys Leu Lys Asp Ser Tyr 
130 135 140 

gaa gaa caa ata aat ggt eta att gac gga ggt att gac ctt eta ttg 480 
Glu Glu Gin He Asn Gly Leu He Asp Gly Gly He Asp Leu Leu Leu 
145 150 155 160 

att gaa aca tgc caa gat gtt tta caa ata aaa tea gca tta tct get 528 
He Glu Thr Cys Gin Asp Val Leu Gin He Lys Ser Ala Leu Ser Ala 
165 170 175 

tct caa gaa gtt att aaa aac agg aat att gaa tta cca ata atg ata 576 
Ser Gin Glu Val He Lys Asn Arg Asn He Glu Leu Pro He Met He 
180 185 190 

tec ata act atg gaa ace aca gga acg atg ctt gtc ggg tea gat ata 624 
Ser He Thr Met Glu Thr Thr Gly Thr Met Leu Val Gly Ser Asp He 
195 200 205 

get tct gca tta aca ata tta gag cca tac aat att gat att ctg gga 672 
Ala Ser Ala Leu Thr He Leu Glu Pro Tyr Asn He Asp He Leu Gly 
210 215 220 

ctg aat tgt gca act ggt cca gtt caa atg aaa gaa cat att aag tat 720 
Leu Asn Cys Ala Thr Gly Pro Val Gin Met Lys Glu His He Lys Tyr 
225 230 235 240 

tta get gaa aat tea cct ttt gca att agt tgt ata cct aat gca gga 768 
Leu Ala Glu Asn Ser Pro Phe Ala He Ser Cys He Pro Asn Ala Gly 
245 250 255 

tta cct gaa aat ata gga ggt gtt get cac tat aaa tta act cca ttg 816 
Leu Pro Glu Asn He Gly Gly Val Ala His Tyr Lys Leu Thr Pro Leu 
260 265 270 

gag ttg aaa atg cag tta atg aac ttt att tat gat ttt aac gta caa 864 
Glu Leu Lys Met Gin Leu Met Asn Phe He Tyr Asp Phe Asn Val Gin 
275 280 285 

ctt att ggc gga tgt tgt ggt act act cct gaa cat ate aag cat tta 912 
Leu He Gly Gly Cys Cys Gly Thr Thr Pro Glu His He Lys His Leu 
290 295 300 

tea tea ate att gag gaa ata gtt gat aaa aaa ata aat aaa aga ctt 960 
Ser Ser He He Glu Glu He Val Asp Lys Lys He Asn Lys Arg Leu 
305 310 315 * 320 
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cct act gta aaa aca aat ttt gtt cct tea gca get tct ata tat aac 1008 
Pro Thr Val Lys Thr Asn Phe Val Pro Ser Ala Ala Ser lie Tyr Asn 
325 330 335 

gca gtt cca tat aaa caa gat aac tea ata tta ata gtt gga gaa cgt 1056 
Ala Val Pro Tyr Lye Gin Asp Asn Ser He Leu He Val Gly Glu Arg 
340 345 350 

tta aat get agt gga tea aaa aaa gta agg gaa tta eta aat gaa gat 1104 
Leu Asn Ala Ser Gly Ser Lys Lys Val Arg Glu Leu Leu Asn Glu Asp 
355 360 365 

gat tgg gac ggc ctg eta tea att get aaa caa cag caa aaa gaa aat 1152 
Asp Trp Asp Gly Leu Leu Ser He Ala Lys Gin Gin Gin Lys Glu Asn 
370 375 380 

get cac ata eta gat gtc aat gtt gat tat gta gga aga gat gga gtt 1200 
Ala His He Leu Asp Val Asn Val Asp Tyr Val Gly Arg Asp Gly Val 
385 390 395 400 

aaa gat atg aaa gaa att acc tea aga tta gtt aca aat ata aat ctt 1248 
Lys Asp Met Lys Glu He Thr Ser Arg Leu Val Thr Asn He Asn Leu 
405 410 415 

cca tta atg ata gat tea aca gaa gca gat aaa atg gaa agt gga tta 1296 
Pro Leu Met He Asp Ser Thr Glu Ala Asp Lys Met Glu Ser Gly Leu 
420 425 430 

aag act gta gga gga aaa tgc att ata aat tea aca aac tac gaa gat 1344 
Lys Thr Val Gly Gly Lys Cys He He Asn Ser Thr Asn Tyr Glu Asp 
435 440 445 

gga gat gac aga ttt aat cag gtc tta aga ctt gca tta gat tat ggt 1392 
Gly Asp Asp Arg Phe Asn Gin Val Leu Arg Leu Ala Leu Asp Tyr Gly 
450 455 460 

get gga ata gta att gga act att gat gaa gat gga atg gca aga aca 1440 
Ala Gly He Val He Gly Thr He Asp Glu Asp Gly Met Ala Arg Thr 
465 470 475 480 

tea cag aaa aaa tat gac att gca aaa aga gca tta att aaa act aga 1488 
Ser Gin Lys Lys Tyr Asp He Ala Lys Arg Ala Leu He Lys Thr Arg 
485 490 495 

tea agt ggc etc get gat tat gag ata ttt ttt gat cct eta gca ttg 1536 
Ser Ser Gly Leu Ala Asp Tyr Glu He Phe Phe Asp Pro Leu Ala Leu 
500 50S 510 

cca ata tct act gga att gaa gaa gat aga tta aat get aaa gca act 1584 
Pro He Ser Thr Gly He Glu Glu Asp Arg Leu Asn Ala Lys Ala Thr 
515 520 525 

att gaa get ata tea aaa ata aga aaa age ttt cca gat att cat att 1632 
He Glu Ala He Ser Lys He Arg Lys Ser Phe Pro Asp He His He 
530 535 540 

att tta ggg ata tct aat att agt ttc ggg ctt tea cca tta tea aga 1680 
He Leu Gly He Ser Asn He Ser Phe Gly Leu Ser Pro Leu Ser Arg 
545 550 555 560 

att aat eta aat tea ata ttt etc gat gaa tgt ata aag gca gga tta 1728 
He Asn Leu Asn Ser He Phe Leu Asp Glu Cys He Lys Ala Gly Leu 
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565 570 575 

gat tea gcg att att gca cca aat aaa ata ttg cct ctt tea aaa ata 1776 
Asp ser Ala He He Ala Pro Asn Lys He Leu Pro Leu Ser Lys He 
580 585 590 

tct gcg gaa aca aaa aaa tta tgt tta gat tta att tat gac aga aga 1824 
Ser Ala Glu Thr Lys Lys Leu Cys Leu Asp Leu He Tyr Asp Arg Arg 
595 600 605 

aat ttc gaa aat gaa ata tgt ata tat gat cca tta gtt gaa eta aca 1872 
Asn Phe Glu Asn Glu He Cys He Tyr Asp Pro Leu Val Glu Leu Thr 
610 615 620 

aaa gca ttc caa gat ata aca ate agt gac ttt aaa aaa gga tct act 1920 
Lys Ala Phe Gin Asp He Thr He Ser Asp Phe Lys Lys Gly Ser Thr 
625 630 635 640 

tea aac aaa aac etc ace tta gaa gaa aaa ctt aaa aac cat att gta 1968 
Ser Asn Lys Asn Leu Thr Leu Glu Glu Lys Leu Lys Asn His He Val 
645 650 655 

gat ggg gaa aaa ata ggt tta gaa gaa caa tta aat aat gcg ctt aaa 2016 
Asp Gly Glu Lys He Gly Leu Glu Glu Gin Leu Asn Asn Ala Leu Lys 
660 665 670 

aag tac aaa cca ctt gaa ata att aat act tat tta tta gat gga atg 2064 
Lys Tyr Lys Pro Leu Glu He He Asn Thr Tyr Leu Leu Asp Gly Met 
675 680 685 

aaa gta gtc ggt gaa eta ttt gga tec ggc caa atg caa tta cct ttt 2112 
Lys Val Val Gly Glu Leu Phe Gly Ser Gly Gin Met Gin Leu Pro Phe 
690 695 700 

gta ttg caa tea gcg gaa aca atg aaa ttt get gtt tea gtg ctt gaa 2160 
Val Leu Gin Ser Ala Glu Thr Met Lys Phe Ala Val Ser Val Leu Glu 
705 710 715 720 

cct cat atg gaa aca gta gat gaa aaa ata tct aac gga aaa tta eta 2208 
Pro His Met Glu Thr Val Asp Glu Lys He Ser Asn Gly Lys Leu Leu 
725 730 735 

ata gca act gtt aaa gga gat gtt cat gat ata ggt aaa aat tta gtt 2256 
He Ala Thr Val Lys Gly Asp Val HiB Asp He Gly Lys Asn Leu Val 
740 745 750 

gat ata att etc tea aat aat ggt ttt gat gta ate aac ctt gga att 2304 
Asp He He Leu Ser Asn Asn Gly Phe Asp Val He Asn Leu Gly He 
755 760 765 

aag caa gat gtt tea gcg att att gat gca caa aaa aaa cat aaa gca 2352 
Lys Gin Asp Val Ser Ala He He Asp Ala Gin Lys Lys His Lys Ala 
770 775 780 

gac tgt att get atg agt ggt tta ctt gtt aaa tct aca gca ttt atg 2400 
Asp Cys He Ala Met Ser Gly Leu Leu Val Lys Ser Thr Ala Phe Met 
785 7 90 795 800 

aag gat aat tta gaa gca ttt aac aat get gaa att aat gtt cca gtt 2448 
Lys Asp Asn Leu Glu Ala Phe Asn Asn Ala Glu He Asn Val Pro Val 
805 810 815 
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att ctt gga ggt gca gca tta act cca aaa ttt gtg aat gaa gat tgt 2496 
He Leu Gly Gly Ala Ala Leu Thr Pro Lye Phe Val Asn Glu Asp Cys 
820 825 830 

agt cag ata tat aaa ggt aaa att ttg tat ggg aaa gat get ttt aca 2544 
Ser Gin He Tyr Lye Gly Lys He Leu Tyr Gly Lys Asp Ala Phe Thr 
835 840 845 

gat tta caa ttt atg aat gac tat atg gat agt aaa aag aag ggc aat 2592 
Asp Leu Gin Phe Met Asn Asp Tyr Met Asp Ser Lys Lys Lys Gly Asn 
850 855 860 

tgg tct aat gaa aat ggt ttt act aat act gat gat att caa att aaa 2640 
Trp Ser Asn Glu Asn Gly Phe Thr Asn Thr Asp Asp He Gin He Lys 
865 870 875 880 

tta get tec cca agg tct tec get aaa gat aaa aat tta aat aaa aat 2688 
Leu Ala Ser Pro Arg Ser Ser Ala Lys Asp Lys Asn Leu Asn Lys Asn 
885 890 895 

ttt gaa aaa acc aaa agt att caa tta att gag aat ttt aat aga tct 2736 
Phe Glu Lys Thr Lys Ser He Gin Leu He Glu Asn Phe Asn Arg Ser 
900 905 910 

aat ttt gta gag gaa gag gaa cct ata aag get cca ttt ttg gga act 2784 
Asn Phe Val Glu Glu Glu Glu Pro He Lys Ala Pro Phe Leu Gly Thr 
915 920 925 

aga gtt ctt caa gat att gaa ata gac ttt gac aaa eta att ttt tat 2832 
Arg Val Leu Gin Asp He Glu He Asp Phe Asp Lys Leu He Phe Tyr 
930 935 940 

eta gat aaa aaa gca tta ttt agt ggt caa tgg caa att aaa aaa aat 2880 
Leu Asp Lys Lys Ala Leu Phe Ser Gly Gin Trp Gin He Lys Lye Asn 
945 950 955 960 

aaa ggt caa tea gta gaa gaa tac aat aat tat tta gat tea tat gca 2928 
Lys Gly Gin ser Val Glu Glu Tyr Asn Asn Tyr Leu Asp Ser Tyr Ala 
965 970 975 

aat cca tta ctt gaa aaa tgg att aat att att tta gat aaa ggc tta 2976 
Asn Pro Leu Leu Glu Lys Trp He Asn He He Leu Asp LyB Gly Leu 
980 985 990 

att tea cca aaa gta gtc tat ggc tac ttc cgt tgc ggg agg aat gat 3024 
He Ser Pro Lys Val Val Tyr Gly Tyr Phe Arg Cys Gly Arg Asn Asp 
995 1000 1005 

aat agt att tat etc ttt gat aat gta tea aat aaa aga att tct gaa 3072 
Asn Ser He Tyr Leu Phe Asp Asn Val Ser Asn Lys Arg He Ser Glu 
1010 1015 1020 

ttt aac ttt cct aga caa aaa teg gga aat aat ctt tgt att gca gat 3120 
Phe Asn Phe Pro Arg Gin Lys Ser Gly Asn Asn Leu Cys He Ala Asp 
1025 1030 1035 1040 

ttt tac tgt gat ctt aaa aat aat gat cca gta gat ata ttt cca atg 316B 
Phe Tyr Cys Asp Leu Lys Asn Asn Asp Pro Val Asp He Phe Pro Met 
1045 1050 1055 

caa gca gta aca atg ggg gaa ata get age gaa tat tec caa gaa tta 3216 
Gin Ala Val Thr Met Gly Glu He Ala Ser Glu Tyr Ser Gin Glu Leu 
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1060 1065 1070 

ttt aaa get gat aaa tat agt gat tat tta ata ttt cat ggt tta acc 3264 
Phe Lys Ala Asp Lys Tyr Ser Asp Tyr Leu He Phe His Gly Leu Thr 
1075 1080 1085 

gtt caa tta gca gaa get ctt gca gaa tat gtt cat tea ata gta aga 3312 
Val Gin Leu Ala Glu Ala Leu Ala Glu Tyr Val His Ser He Val Arg 
1090 1095 1100 

att gaa tgc gga ttt aaa tea tat gag cca aac aat aac cgt gat ata 3360 
He Glu Cys Gly Phe Lys Ser Tyr Glu Pro Asn Asn Asn Arg Asp He 
1105 1110 1H5 * H20 

tta get caa aaa tat aga gga get aga tac tea ttt ggt tat cca get 3408 
Leu Ala Gin Lys Tyr Arg Gly Ala Arg Tyr Ser Phe Gly Tyr Pro Ala 
1125 1130 1135 

tgt cct aaa gtt tct gat tea aat ata cag tta tea tta ttg gat aca 3456 
Cys Pro Lys Val Ser Asp Ser Asn He Gin Leu Ser Leu Leu Asp Thr 
1140 1145 H50 

aaa agg att aat tta aca atg gat gaa tea gag caa tta cat cct gaa 3504 
Lys Arg He Asn Leu Thr Met Asp Glu Ser Glu Gin Leu His Pro Glu 
1155 1160 1165 

caa agt act act get ata att tea ctt cat tea aaa gca aaa tat ttt 3552 
Gin Ser Thr Thr Ala He He Ser Leu His Ser Lys Ala Lys Tyr Phe 
1170 1175 1180 

agt gee taa 3 561 

Ser Ala 

1185 



<210> 6 
<211> 1186 
<212> PRT 

<213> Prochlorococcus marinus 
<400> 8 

Met Val Ser Phe Arg Asn Tyr Leu Asn Arg Asp Asp Lys Pro He He 
1 5 10 15 

He Phe Asp Gly Gly Thr Gly Thr Ser Phe Gin Asn Leu Asn Leu Ser 
20 25 30 

Ser His Asp Phe Gly Gly Asp Asp Leu Glu Gly Cys Asn Glu Asn Leu 
35 40 45 

Val Leu Ser Ser Pro Asn Thr Val Glu Gin Val His Asn Ser Phe Leu 
50 55 60 

Glu Ala Gly Cys His Val He Glu Thr Asn Thr Phe Gly Ala Ser Ser 
65 70 75 80 

He Val Leu Asp Glu Tyr Ser He Ser Asn Lys Ala Tyr Glu He Asn 
85 90 95 

Lys Lys Ala Ala Gin He Ala Lys Lys Cys Ala Asn Leu Phe Ser Ser 
100 105 no 
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lie Asn Thr Pro Arg Phe Val Ala Gly Ser He Qly Pro Thr Thr Lys 
115 120 125 

Leu Pro Thr Leu Gly Hie He Ser Phe Asp Lys Leu Lys Asp, Ser Tyr 
130 135 140 

Glu Glu Gin He Asn Gly Leu He Asp Gly Gly He Asp Leu Leu Leu 
145 150 155 160 

He Glu Thr Cys Gin Asp Val Leu Gin, lie Lys Ser Ala Leu Ser Ala 
165 170 175 

Ser Gin Glu Val He Lys Asn Arg Asn He Glu Leu Pro He Met He 
180 185 190 

Ser lie Thr Met Glu Thr Thr Gly Thr Met Leu Val Gly Ser Asp He 
195 200 205 

Ala Ser Ala Leu Thr lie Leu Glu Pro Tyr Asn He Asp He Leu Gly 
210 215 220 

Leu Asn Cys Ala Thr Gly Pro Val Gin Met Lys Glu His He Lys Tyr 
225 230 235 240 

Leu Ala Glu Asn Ser Pro Phe Ala lie Ser Cys He Pro Asn Ala Gly 
245 250 255 

Leu Pro Glu Asn lie Gly Gly Val Ala His Tyr Lys Leu Thr, Pro Leu 
260 265 270 

Glu Leu Lys Met Gin Leu Met Asn Phe lie Tyr Asp Phe Asn Val Gin 
275 280 285 

Leu lie Gly Gly Cys Cys Gly Thr Thr Pro Glu His He Lys His Leu 
290 295 300 

Ser Ser He lie Glu Glu lie Val Asp Lys Lys lie Asn Lys Arg Leu 
305 310 315 320 

Pro Thr Val Lys Thr Asn Phe Val Pro Ser Ala Ala Ser He Tyr Asn 
325 330 335 

Ala Val Pro Tyr Lys Gin Asp Asn Ser lie Leu lie Val Gly Glu Arg 
340 345 350 

Leu Asn Ala Ser Gly Ser Lys Lys Val Arg Glu Leu Leu Asn Glu Asp 
355 360 365 

Asp Trp Abp Gly Leu Leu Ser He Ala Lys Gin Gin Gin Lys Glu Asn 
370 375 380 

Ala His lie Leu Asp Val Asn Val Asp Tyr Val Gly Arg Asp Gly Val 
385 390 395 ~ * 400 

Lys Asp Met Lys Glu He Thr Ser Arg Leu Val Thr Asn lie Asn Leu 
405 410 415 

Pro Leu Met lie Asp Ser Thr Glu Ala Asp Lys Met Glu Ser Gly Leu 
420 425 430 

Lys Thr Val Gly Gly Lys Cys He He Asn Ser Thr Asn Tyr Glu Asp 
435 440 445 
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Gly Asp Asp Arg Phe Asn Gin Val Leu Arg Leu Ala Leu Asp Tyr Gly 
450 455 460 

Ala Gly He Val He Gly Thr He Asp Glu Asp Gly Met Ala Arg Thr 
465 470 475 480 

Ser Gin Lys Lys Tyr Asp He Ala Lys Arg Ala Leu He Lys Thr Arg 
485 490 495 

Ser Ser Gly Leu Ala Asp Tyr Glu He Phe Phe Asp Pro Leu Ala Leu 
500 505 510 

Pro He Ser Thr Gly He Glu Glu Asp Arg Leu Asn Ala Lys Ala Thr 
515 520 525 

He Glu Ala He Ser Lys He Arg Lys Ser Phe Pro Asp He His He 
530 535 540 

He Leu Gly He Ser Asn He Ser Phe Gly Leu Ser Pro Leu Ser Arg 
545 550 555 5 60 

He Asn Leu Asn Ser He Phe Leu Asp Glu Cys He Lys Ala Gly Leu 
565 570 575 

Asp Ser Ala He He Ala Pro Asn Lys He Leu Pro Leu Ser Lys He 
580 585 590 

Ser Ala Glu Thr Lys Lys Leu Cys Leu Asp Leu He Tyr Asp Arg Arg 
595 600 605 

Asn Phe Glu Asn Glu He Cys He Tyr Asp Pro Leu Val Glu Leu Thr 
610 615 620 

Lys Ala Phe Gin Asp He Thr He Ser Asp Phe Lys Lys Gly Ser Thr 
625 630 635 * 640 

Ser Asn Lys Asn Leu Thr Leu Glu Glu Lys Leu Lys Asn His He Val 
645 650 655 

Asp Gly Glu Lys He Gly Leu Glu Glu Gin Leu Asn Asn Ala Leu Lys 
660 665 670 

Lys Tyr Lys Pro Leu Glu He He Asn Thr Tyr Leu Leu Asp Gly Met 
675 680 685 

Lys Val Val Gly Glu Leu Phe Gly Ser Gly Gin Met Gin Leu Pro Phe 
690 695 700 

Val Leu Gin Ser Ala Glu Thr Met Lys Phe Ala Val Ser Val Leu Glu 
705 710 715 720 

Pro His Met Glu Thr Val Asp Glu Lys He Ser Asn Gly Lys Leu Leu 
725 730 735 

He Ala Thr Val Lys Gly Asp Val His Asp lie Gly Lys Asn Leu Val 
740 745 750 

Asp He He Leu Ser Asn Asn Gly Phe Asp Val He Asn Leu Gly lie 
755 760 765 

Lys Gin Asp Val Ser Ala He lie Asp Ala Gin Lys Lys His Lys Ala 
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770 775 780 

Asp Cys He Ala Met Ser Gly Leu Leu Val Lys Ser Thr Ala Phe Met 
785 790 795 , 800 

Lys Asp Asn Leu Glu Ala Phe Asn Asn Ala Glu He Asn Val Pro Val 
805 810 815 

He Leu Gly Gly Ala Ala Leu Thr Pro Lys Phe Val Asn Glu Asp Cys 
820 825 830 

Ser Gin He Tyr Lys Gly Lys He Leu Tyr Gly Lys Asp Ala Phe Thr 
835 B40 845 

Asp Leu Gin Phe Met Asn Asp Tyr Met Asp Ser Lys Lys Lys Gly Asn 
850 855 860 

Trp Ser Asn Glu Asn Gly Phe Thr Asn Thr Asp Asp He Gin He Lys 
865 870 875 880 

Leu Ala Ser Pro Arg Ser Ser Ala Lys Asp Lys Asn Leu Asn Lys Asn 
885 890 895 

Phe Glu Lys Thr Lys Ser He Gin Leu He Glu Asn Phe Asn Arg Ser 
900 905 910 

Asn Phe Val Glu Glu Glu Glu Pro He Lys Ala Pro Phe Leu Gly Thr 
915 920 925 

Arg Val Leu Gin Asp He Glu He Asp Phe Asp Lys Leu He Phe Tyr 
930 935 940 

Leu Asp Lys Lys Ala Leu Phe Ser Gly Gin Trp Gin He Lys Lys Asn 
9*5 950 955 960 

Lys Gly Gin Ser Val Glu Glu Tyr Asn Asn Tyr Leu Asp Ser Tyr Ala 
965 970 975 

Asn Pro Leu Leu Glu Lys Trp He Asn He He Leu Asp Lys Gly Leu 
980 985 990 

He Ser Pro Lys Val Val Tyr Gly Tyr Phe Arg Cys Gly Arg Asn Asp 
995 1000 1005 

Asn Ser He Tyr Leu Phe Asp Asn Val Ser Asn Lys Arg He Ser Glu 
1010 1015 1020 

Phe Asn Phe Pro Arg Gin Lys Ser Gly Asn Asn Leu Cys He Ala Asp 
1025 1030 1035 1040 

Phe Tyr Cys Asp Leu Lys Asn Asn Asp Pro Val Asp He Phe Pro Met 
1045 1050 1055 

Gin Ala Val Thr Met Gly Glu He Ala Ser Glu Tyr Ser Gin Glu Leu 
1060 1065 1070 

Phe Lys Ala Asp Lys Tyr Ser Asp Tyr Leu He Phe His Gly Leu Thr 
1075 1080 1085 

Val Gin Leu Ala Glu Ala Leu Ala Glu Tyr Val His Ser He Val Arg 
1090 1095 1100 
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He Glu Cys Gly Phe Lys Ser Tyr Glu Pro Asn Abii Asn Arg ABp He 
1105 HIO 1115 1120 

Leu Ala Gin Lys Tyr Arg Gly Ala Arg Tyr Ser Phe Gly Tyr Pro Ala 
H25 H30 1135 

Cys Pro Lys Val Ser Asp Ser Asn He Gin Leu Ser Leu Leu Asp Thr 
1140 H45 H50 

LyB Arg He Asn Leu Thr Met Asp Glu Ser Glu Gin Leu His Pro Glu 
H55 H60 H65 

Gin Ser Thr Thr Ala lie lie Ser Leu His Ser Lys Ala Lys Tyr Phe 
H70 1175 H80 

Ser Ala 
1185 



<210> 9 
<211> 3048 
<212> DNA 

<213> Thermus thermophilus 

<220> 

<221> CDS 

<222> (1)..(3045) 

<223> RTT00266 

<400> 9 

atg egg gec tac aag gag gcg gca egg ggg ctt ctt aag ggc ggg gtg 48 
Met Arg Ala Tyr Lys Glu Ala Ala Arg Gly Leu Leu LyB Gly Gly Val 
1 5 io i 5 

gac etc ate etc ttg gag acc gee cag gac ate etc cag gtg cgc tgc 96 
Asp Leu lie Leu Leu Glu Thr Ala Gin Asp lie Leu Gin Val Arg Cys 
20 25 30 

gee gtc ttg gcg gtg egg gag gee atg gee gag gtg ggc egg gag gtg 144 
Ala Val Leu Ala Val Arg Glu Ala Met Ala Glu Val Gly Arg Glu Val 
35 40 45 

ccc etc cag gtc cag gtg acc ttt gag gee acg ggg acg atg etc gtg 192 
Pro Leu Gin Val Gin Val Thr Phe Glu Ala Thr Gly Thr Met Leu Val 
50 55 60 

ggc acg gac gag cag gcg gee ctg gee get ctg gag age etc ccc gtg 240 
Gly Thr ABp Glu Gin Ala Ala Leu Ala Ala Leu Glu Ser Leu Pro Val 
65 70 75 80 

gac gtg gtg ggg atg aac tgc gee acg ggc ccc gac etc atg gac age 288 
Asp Val Val Gly Met Asn Cys Ala Thr Gly Pro Asp Leu Met Asp Ser 
85 90 95 

aag gtg cgc tac ttc gee gag cac age acc cgc ttc gtc tec tgc etc 336 
Lys Val Arg Tyr Phe Ala Glu His Ser Thr Arg Phe Val Ser Cys Leu 
100 105 no 

ccg aac gcg ggc ctg ccc egg aac gag ggg ggg agg gtg gtc tac gac 364 
Pro Asn Ala Gly Leu Pro Arg Asn Glu Gly Gly Arg Val Val Tyr Asp 
H5 120 125 
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etc acc ccc gag gag etc gec aag tgg cac etc aag ttc gtg gee gag 432 
Leu Thr Pro Glu Glu Leu Ala Lys Tip His Leu Lys Phe Val Ala Glu 
130 135 140 

tac ggg gtg aac gec gtg ggg gga tgc tgc ggc acg ggg ccc gag cac 480 
Tyr Gly Val Asn Ala Val Gly Gly Cys Cys Gly Thr Gly Pro Glu His 
145 150 155 160 

ata agg aag gtg gec gag gcg gtg aag ggg etc gec ccg aag cca agg 528 
lie Arg Lys Val Ala Glu Ala Val Lys Gly Leu Ala Pro Lys Pro Arg 
165 170 175 

ccc gaa age ttc cct ccc cag gtg gee tec ttg tac cag gcg gtg tec 576 
Pro Glu Ser Phe Pro Pro Gin Val Ala Ser Leu Tyr Gin Ala Val Ser 
180 185 190 

etc aag cag gag gcg age ctt ttc etc gtg ggg gag agg etc aac gee 624 
Leu Lys Gin Glu Ala Ser Leu Phe Leu Val Gly Glu Arg Leu Asn Ala 
195 200 205 

acg ggg age aag cgc ttc egg gag atg etc ttc gcg aga gac etc gag 672 
Thr Gly Ser Lys Arg Phe Arg Glu Met Leu Phe Ala Arg Asp Leu Glu 
210 215 220 

ggc ate etc gee etc gee egg gag cag gtg gag gag ggg gec cac gee 720 
Gly lie Leu Ala Leu Ala Arg Glu Gin Val Glu Glu Gly Ala His Ala 
225 230 235 240 

ctg gac etc tec gtg gee tgg acg ggg egg gac gag ctt gag gac etc 768 
Leu Asp Leu Ser Val Ala Trp Thr Gly Arg Asp Glu Leu Glu Abp Leu 
245 250 255 

egg tgg etc ctt ccc cat etc gee acc gee ctt acc gtc ccc gtc atg 816 
Arg Trp Leu Leu Pro His Leu Ala Thr Ala Leu Thr Val Pro Val Met 
260 265 270 

gtg gac tec acc tec cct gag gee atg gag etc gec etc aaa tac etc 864 
Val Asp Ser Thr Ser Pro Glu Ala Met Glu Leu Ala Leu Lys Tyr Leu 
275 280 285 

ccg ggc egg gtc etc ctg aac tec gee aac etc gag gat ggc tta gag 912 
Pro Gly Arg Val Leu Leu Asn Ser Ala Asn Leu Glu Asp Gly Leu Glu 
290 295 300 

cgc ttt gac egg gtg gec tec ctg gec aag gec cac ggg gcg gec etc 960 
Arg Phe Asp Arg Val Ala Ser Leu Ala Lys Ala His Gly Ala Ala Leu 
305 310 315 320 

gtg gtc etc gec att gac gag aag ggg atg gec aag acc egg gag gag 1008 
Val Val Leu Ala lie Asp Glu Lys Gly Met Ala Lys Thr Arg Glu Glu 
325 330 335 

aag gtg egg gtg gec ctg agg atg tac gag cgc etc acg gag cac cac 1056 
Lys Val Arg Val Ala Leu Arg Met Tyr Glu Arg Leu Thr Glu His His 
340 345 350 

ggc etc cgc ccc gag gac etc etc ttt gac etc ctt acc ttc ccc ate 1104 
Gly Leu Arg Pro Glu Asp Leu Leu Phe Asp Leu Leu Thr Phe Pro lie 
355 360 365 



acc caa ggg gac gag gag age cgc cct ctg gee aag gag acc etc etc 
Thr Gin Gly Asp Glu Glu Ser Arg Pro Leu Ala Lys Glu Thr Leu Leu 



1152 
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370 375 380 

gcc ata gag gag eta egg gag agg ctt ccc ggg gtg ggc ttc gtc ctt 1200 
Ala lie Glu Glu Leu Arg Glu Arg Leu Pro Gly Val Gly Phe Val Leu 
385 390 395 400 

egg gtc tec aac gtc tec ttc ggg etc aag ccc egg gcg agg cgc gtc 1248 
Arg Val Ser Asn Val Ser Phe Gly Leu Lye Pro Arg Ala Arg Arg Val 
405 410 415 

ctg aac tec gtc ttc ctg gac gag gcg agg aaa egg ggc etc acc gcg 1296 
Leu Asn Ser Val Phe Leu Asp Glu Ala Arg Lys Arg Gly Leu Thr Ala 
420 425 430 

gcc ate gtg gac gcg ggg aag ate etc ccc ata age cag ate ccc gag 1344 
Ala lie Val Asp Ala Gly Lye lie Leu Pro He Ser Gin He Pro Glu 
435 440 445 

gag gcc tac gcc etc gcc tta gac etc ate tac gac cgc cgc aag gag 1392 
Glu Ala Tyr Ala Leu Ala Leu Asp Leu He Tyr Asp Arg Arg Lys Glu 
450 455 460 

ggc ttt gac ccc etc etc gcc ttc atg gcc tac ttt gag gcc cac aag 1440 
Gly Phe Asp Pro Leu Leu Ala Phe Met Ala Tyr Phe Glu Ala His Lys 
465 470 475 480 

gag gac ccg ggg aag agg gag gac gcc ttc ctg gcc ctt ccc ctt ctg 1488 
Glu Asp Pro Gly Lys Arg Glu Asp Ala Phe Leu Ala Leu Pro Leu Leu 
485 490 495 

gag agg etc aag cgc cgc gtg gtg gag ggg agg aag cag ggc etc gag 1536 
Glu Arg Leu Lys Arg Arg Val Val Glu Gly Arg Lys Gin Gly Leu Glu 
500 505 510 

gcc gac ctg gag gag gcc ctg aag gcg ggg cac aag ccc ttg gac etc 1584 
Ala Asp Leu Glu Glu Ala Leu Lys Ala Gly His Lys Pro Leu Asp Leu 
515 520 S25 

ate aac ggc ccc etc etc gcg ggg atg aag gag gtg ggg gac etc ttc 1632 
He Asn Gly Pro Leu Leu Ala Gly Met Lys Glu Val Gly Asp Leu Phe 
530 535 * 540 

ggg gcg ggg aag atg cag etc ccc ttc gtc etc cag gcc gcc gag gtg 1680 
Gly Ala Gly Lys Met Gin Leu Pro Phe Val Leu Gin Ala Ala Glu Val 
545 550 555 560 

atg aag egg gcg gtg gcc tac etc gag ccc cac atg gag aag aag ggg 1728 
Met Lys Arg Ala Val Ala Tyr Leu Glu Pro His Met Glu Lys Lys Gly 
565 570 575 

gag ggc aag ggt acc ctg gtc etc gcc acc gtc aag ggg gac gtg cac 1776 
Glu Gly Lys Gly Thr Leu Val Leu Ala Thr Val Lys Gly Asp Val His 
580 585 590 

gac ate ggc aag aac ctg gtg gac ate ate etc age aac aac ggc tac 1824 
Asp He Gly Lys Asn Leu Val Asp He He Leu Ser Asn Asn Gly Tyr 
595 600 605 

c 99 9tg gtg aac ctg ggg ate aag gtg ccc att gag gag ate ctg aag 1872 
Arg Val Val Asn Leu Gly He Lys Val Pro He Glu Glu He Leu Lys 
610 615 620 
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gcc gtg gag gcg cac aag ccc cac gcc gtg ggc atg teg ggc etc ctg 1920 
Ala Val Glu Ala His Lys Pro His Ala Val Gly Met Ser Gly Leu Leu 
625 630 635 640 

gtg aag age acc ctg gtg atg aag gag aac ctg gag tac atg egg gat 1968 
Val Lys Ser Thr Leu Val Met Lys Glu Asn Leu Glu Tyr Met Arg Asp 
645 650 655 

agg ggc tac acc etc ccc gtg ate ctg ggc ggg gcc gcc etc acc egg 2016 
Arg Gly Tyr Thr Leu Pro Val He Leu Gly Gly Ala Ala Leu. Thr Arg 
660 665 670 

age tac gtg gag gag ctt aag gcc ate tac ccc aac gtc tac tac gcc 2064 
Ser Tyr Val Glu Glu Leu Lys Ala He Tyr Pro Asn Val Tyr Tyr Ala 
675 680 685 

gag gac gcc ttt gag ggc tta agg etc atg gag gag etc acg ggc cac 2112 
Glu Asp Ala Phe Glu Gly Leu Arg Leu Met Glu Glu Leu Thr Gly His 
690 695 700 

gcc cct ccc gag etc acc egg aag gcc cea get agg ccc aag egg gag 2160 
Ala Pro Pro Glu Leu Thr Arg Lys Ala Pro Ala Arg Pro Lys Arg Glu 
705 710 715 720 

gcc ccc aag gtg gcg ccc cgc get egg ccc gtg ggg gag gcc ccc gcc 2208 
Ala Pro Lys Val Ala Pro Arg Ala Arg Pro Val Gly Glu Ala Pro Ala 
725 730 735 

gtc ccc egg ccc ccc ttc ttc ggc gtg egg gtg gag gaa ggc ttg gac 2256 
Val Pro Arg Pro Pro Phe Phe Gly Val Arg Val Glu Glu Gly Leu Asp 
740 745 750 

etc gcc acc ate gcc cac tac gtc aac aag etc gcc etc tac egg ggc 2304 
Leu Ala Thr He Ala His Tyr Val Asn Lys Leu Ala Leu Tyr Arg Gly 
755 760 765 

cag tgg ggc tac age cgc aag ggc ttt ccc ggg agg cgt ggc agg ccc 2352 
Gin Trp Gly Tyr Ser Arg Lys Gly Phe Pro Gly Arg Arg Gly Arg Pro 
770 775 780 

tgg tgg age ggg agg egg age ctg tct tec aga ggc tec tea agg agg 2400 
Trp Trp Ser Gly Arg Arg Ser Leu Ser Ser Arg Gly Ser Ser Arg Arg 
785 790 795 800 

cga tgg egg aag ggt ggc ttg aac cca agg tec tct acg get tct tec 2448 
Arg Trp Arg Lys Gly Gly Leu Asn Pro Arg Ser Ser Thr Ala Ser Ser 
805 810 815 

ccg tgg ccc ggg agg gga gga get tct cgt ctt etc ccc aga gac ggg 2496 
Pro Trp Pro Gly Arg Gly Gly Ala Ser Arg Leu Leu Pro Arg Asp Gly 
820 825 830 

gga ggt get gga gcg ctt ccg ctt ccc ccg gca aag ggg egg ggg cct 2544 
Gly Gly Ala Gly Ala Leu Pro Leu Pro Pro Ala Lys Gly Arg Gly Pro 
835 840 845 

gag cct cgt gga eta ctt ccg ccc ccg gtt tgc cgc gcc ttt ggg gga 2592 
Glu Pro Arg Gly Leu Leu Pro Pro Pro Val Cys Arg Ala Phe Gly Gly 
850 855 860 



cga ggc gga ctg gat gcc caa gga ggc ctt ccg ggc ggg ggc egg gac 
Arg Gly Gly Leu Asp Ala Gin Gly Gly Leu Pro Gly Gly Gly Arg Asp 



2640 
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865 870 875 880 

gtc etc ggg gtc cag etc gtc acc atg ggg gag gee cct tec cga aag 2688 
Val Leu Gly Val Gin Leu Val Thr Met Gly Glu Ala Pro Ser Arg Lys 
885 890 895 

gec cag gec etc ttt gcg tec ggg gee tac cag gac tac etc ttc gtc 2736 
Ala Gin Ala Leu Phe Ala Ser Gly Ala Tyr Gin Asp Tyr Leu Phe Val 
900 905 ~ 910 

cac ggc ttc age gtg gag atg acc gag gee ttg gcg gag tac tgg cac 2784 
His Gly Phe Ser Val Glu Met Thr Glu Ala Leu Ala Glu Tyr Trp His 
915 920 925 

aag agg atg egg cag atg tgg ggc ate gec cac aag gac gee acc gag 2832 
LyB Arg Met Arg Gin Met Trp Gly lie Ala His Lys Asp Ala Thr Glu 
930 935 940 

ate cag aag etc ttc cag cag ggc tac cag ggg gee cgc tac tec ttc 2880 
lie Gin Lys Leu Phe Gin Gin Gly Tyr Gin Gly Ala Arg Tyr Ser Phe 
945 950 955 960 

ggc tac ccc gec tgc ccg gac etc gee gac cag gec aag ctg gac egg 2928 
Gly Tyr Pro Ala Cys Pro Asp Leu Ala Asp Gin Ala Lys Leu Asp Arg 
965 970 975 

etc atg ggc ttc cac egg gtg ggg gtg cac etc acg gag aac ttc cag 2976 
Leu Met Gly Phe His Arg Val Gly Val His Leu Thr Glu Asn Phe Gin 
980 985 990 

ctg gag ccg gag cac gee acc age gec etc gtg gtc cac cac ccc gag 3024 
Leu Glu Pro Glu Hie Ala Thr Ser Ala Leu Val Val His His Pro Glu 
995 1000 1005 

gec cgc tac ttc age gtg gac tag 3048 
Ala Arg Tyr Phe Ser Val Asp 
1010 1015 



<210> 10 
<211> 1015 
<212> PRT 

<213> Thermus thermophilus 
<400> 10 

Met Arg Ala Tyr Lys Glu Ala Ala Arg Gly Leu Leu Lys Gly Gly Val 
1 5 10 15 

Asp Leu lie Leu Leu Glu Thr Ala Gin Asp He Leu Gin Val Arg Cys 
20 25 30 

Ala Val Leu Ala Val Arg Glu Ala Met Ala Glu Val Gly Arg Glu Val 
35 40 45 

Pro Leu Gin Val Gin Val Thr Phe Glu Ala Thr Gly Thr Met Leu Val 
50 55 60 

Gly Thr Asp Glu Gin Ala Ala Leu Ala Ala Leu Glu Ser Leu Pro Val 
65 70 75 80 

Asp Val Val Gly Met Asn Cys Ala Thr Gly Pro Asp Leu Met Asp Ser 
85 90 95 
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Lys Val Arg Tyr Phe Ala Glu His Ser Thr Arg Phe Val Ser Cys Leu 
100 105 110 

Pro Asn Ala Gly Leu Pro Arg Asn Glu Gly Gly Arg Val Val Tyr Asp 
115 120 125 

Leu Thr Pro Glu Glu Leu Ala Lys Trp His Leu Lys Phe Val Ala Glu 
130 135 140 

Tyr Gly Val Asn Ala Val Gly Gly Cys Cys Gly Thr Gly Pro Glu His 
145 150 155 160 

lie Arg Lys Val Ala Glu Ala Val Lys Gly Leu Ala Pro Lys Pro Arg 
165 170 175 

Pro Glu Ser Phe Pro Pro Gin Val Ala Ser Leu Tyr Gin Ala Val Ser 
180 185 190 

Leu Lys Gin Glu Ala Ser Leu Phe Leu Val Gly Glu Arg Leu Asn Ala 
195 200 205 

Thr Gly Ser Lys Arg Phe Arg Glu Met Leu Phe Ala Arg Asp Leu Glu 
210 215 220 

Gly He Leu Ala Leu Ala Arg Glu Gin Val Glu Glu Gly Ala His Ala 
225 230 235 * 240 

Leu Asp Leu Ser Val Ala Trp Thr Gly Arg Asp Glu Leu Glu Asp Leu 
245 250 255 

Arg Trp Leu Leu Pro His Leu Ala Thr Ala Leu Thr Val Pro Val Met 
260 265 270 

Val Asp Ser Thr Ser Pro Glu Ala Met Glu Leu Ala Leu Lys Tyr Leu 
275 280 285 

Pro Gly Arg Val Leu Leu Asn Ser Ala Asn Leu Glu Asp Gly Leu Glu 
290 295 300 

Arg Phe Asp Arg Val Ala Ser Leu Ala Lys Ala His Gly Ala Ala Leu 
305 310 315 320 

Val Val Leu Ala He Asp Glu Lys Gly Met Ala Lys Thr Arg Glu Glu 
325 330 335 

Lys Val Arg Val Ala Leu Arg Met Tyr Glu Arg Leu Thr Glu His His 
340 345 350 

Gly Leu Arg Pro Glu Asp Leu Leu Phe Asp Leu Leu Thr Phe Pro He 
355 360 365 

Thr Gin Gly Asp Glu Glu Ser Arg Pro Leu Ala Lys Glu Thr Leu Leu 
370 375 380 

Ala He Glu Glu Leu Arg Glu Arg Leu Pro Gly Val Gly Phe Val Leu 
385 390 395 " 400 

Arg Val Ser Asn Val Ser Phe Gly Leu Lys Pro Arg Ala Arg Arg Val 
405 410 415 

Leu Asn Ser Val Phe Leu Asp Glu Ala Arg Lys Arg Gly Leu Thr Ala 
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420 425 430 

Ala He Val Asp Ala Gly Lys He Leu Pro He Ser Gin He Pro Glu 
435 440 445 

Glu Ala Tyr Ala Leu Ala Leu Asp Leu He Tyr Asp Arg Arg Lys Glu 
450 455 " 460 

Gly Phe Asp Pro Leu Leu Ala Phe Met Ala Tyr Phe Glu Ala His Lys 
465 470 475 480 

Glu Asp Pro Gly Lys Arg Glu Asp Ala Phe Leu Ala Leu Pro Leu Leu 
485 490 495 

Glu Arg Leu Lys Arg Arg Val Val Glu Gly Arg Lys Gin Gly Leu Glu 
500 505 510 

Ala Asp Leu Glu Glu Ala Leu Lys Ala Gly His Lys Pro Leu Asp Leu 
515 520 525 

He Asn Gly Pro Leu Leu Ala Gly Met Lys Glu Val Gly Asp Leu Phe 
530 535 540 

Gly Ala Gly Lys Met Gin Leu Pro Phe Val Leu Gin Ala Ala Glu Val 
545 550 555 560 

Met Lys Arg Ala Val Ala Tyr Leu Glu Pro His Met Glu Lys Lys Gly 
565 570 575 

Glu Gly Lys Gly Thr Leu Val Leu Ala Thr Val Lys Gly Asp Val His 
580 565 590 

Asp He Gly Lys Asn Leu Val Asp He He Leu Ser Asn Asn Gly Tyr 
595 600 605 

Arg Val Val Asn Leu Gly He Lys Val Pro He Glu Glu He Leu Lys 
610 615 620 

Ala Val Glu Ala His Lys Pro His Ala Val Gly Met Ser Gly Leu Leu 
625 630 635 640 

Val Lys Ser Thr Leu Val Met Lys Glu Asn Leu Glu Tyr Met Arg Asp 
645 650 655 

Arg Gly Tyr Thr Leu Pro Val He Leu Gly Gly Ala Ala Leu Thr Arg 
660 665 670 

Ser Tyr Val Glu Glu Leu Lys Ala He Tyr Pro Asn Val Tyr Tyr Ala 
675 680 685 

Glu Asp Ala Phe Glu Gly Leu Arg Leu Met Glu Glu Leu Thr Gly His 
690 695 700 

Ala Pro Pro Glu Leu Thr Arg Lys Ala Pro Ala Arg Pro Lys Arg Glu 
705 710 715 720 

Ala Pro Lys Val Ala Pro Arg Ala Arg Pro Val Gly Glu Ala Pro Ala 
725 730 735 

Val Pro Arg Pro Pro Phe Phe Gly Val Arg Val Glu Glu Gly Leu Asp 
740 745 " 750 
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Leu Ala Thr lie Ala His Tyr Val Asn Lys Leu Ala Leu Tyr Arg Gly 
755 760 765 

Gin Trp Gly Tyr Ser Arg Lys Gly Phe Pro Gly Arg Arg Gly Arg Pro 
770 775 780 

Trp Trp Ser Gly Arg Arg Ser Leu Ser Ser Arg Gly Ser Ser Arg Arg 
785 790 795 800 

Arg Trp Arg Lys Gly Gly Leu Asn Pro Arg Ser Ser Thr Ala Ser Ser 
805 810 815 

Pro Trp Pro Gly Arg Gly Gly Ala Ser Arg Leu Leu Pro Arg Asp Gly 
820 825 830 

Gly Gly Ala Gly Ala Leu Pro Leu Pro Pro Ala Lys Gly Arg Gly Pro 
835 840 845 

Glu Pro Arg Gly Leu Leu Pro Pro Pro Val Cys Arg Ala Phe Gly Gly 
850 85S 860 

Arg Gly Gly Leu Asp Ala Gin Gly Gly Leu Pro Gly Gly Gly Arg Asp 
8« 870 875 " 880 

Val Leu Gly Val Gin Leu Val Thr Met Gly Glu Ala Pro Ser Arg Lys 
885 890 895 

Ala Gin Ala Leu Phe Ala Ser Gly Ala Tyr Gin Asp Tyr Leu Phe Val 
900 90S 910 

His Gly Phe Ser Val Glu Met Thr Glu Ala Leu Ala Glu Tyr Trp His 
915 920 925 

Lys Arg Met Arg Gin Met Trp Gly lie Ala His Lys Asp Ala Thr Glu 
930 935 940 

He Gin Lys Leu Phe Gin Gin Gly Tyr Gin Gly Ala Arg Tyr Ser Phe 
945 950 955 " 960 

Gly Tyr Pro Ala Cys Pro Asp Leu Ala Asp Gin Ala Lys Leu Asp Arg 
965 970 975 

Leu Met Gly Phe His Arg Val Gly Val His Leu Thr Glu Asn Phe Gin 
980 985 990 

Leu Glu Pro Glu His Ala Thr Ser Ala Leu Val Val His His Pro Glu 
995 1000 1005 

Ala Arg Tyr Phe Ser Val Asp 
1010 1015 



<210> 11 
<211> 3441 
<212> DNA 

<213> Bacillus halodurans 

<220> 
<221> CDS 
<222> (1) . . (3438) 
<223> RHD05550 
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<400> 11 

atg act aaa teg ttg ttt gaa caa cag tta gag cga aaa ate gtc ate 48 
Met Thr Lys Ser Leu Phe Glu Gin Gin Leu Glu Arg Lys lie Val He 
1 5 10 15 



ctt gat ggg gcg atg ggg acc atg tta caa gee gcg aat eta acc get 
Leu Asp Gly Ala Met Gly Thr Met Leu Gin Ala Ala Asn Leu Thr Ala 
20 25 30 



96 



gat gac ttt ggc gga gaa gag tat gaa ggg tgt aat gaa tat tta aat 144 
Asp Asp Phe Gly Gly Glu Glu Tyr Glu Gly Cys Asn Glu Tyr Leu Asn 
35 40 45 

gag acg gee ccc cat gtc gtt gag gac att cat cgc gca tac tta gag 192 
Glu Thr Ala Pro His Val Val Glu Asp He His Arg Ala Tyr Leu Glu 
50 55 60 

gca gga gca gac gtc att gcg acg aac acg ttc ggg gca aca gat ate 240 
Ala Gly Ala Asp Val He Ala Thr Asn Thr Phe Gly Ala Thr Asp He 
65 70 75 80 

gtt ctt gac gat tat gat etc gga tac aaa gca gag gag tta aac ata 288 
Val Leu Asp Asp Tyr Asp Leu Gly Tyr Lys Ala Glu Glu Leu Asn He 
85 . 90 95 

tgc gcg gtg aaa ate get aaa cgt gta get gaa gag ttt tec act cca 336 
Cys Ala Val Lys He Ala Lys Arg Val Ala Glu Glu Phe Ser Thr Pro 
100 105 no 

gat tgg cct cga ttc gtt gca ggg gcg atg ggg ccg acg acg aaa tct 384 
Asp Trp Pro Arg Phe Val Ala Gly Ala Met Gly Pro Thr Thr Lys Ser 
115 120 125 

ctt tec gtc aca ggg ggc gcg aca ttc gaa caa ctt ate gag tct tat 432 
Leu Ser Val Thr Gly Gly Ala Thr Phe Glu Gin Leu He Glu Ser Tyr 
130 135 140 

cgc cag caa get aca ggt eta att aaa ggc ggg gcg gat att tta tta 480 
Arg Gin Gin Ala Thr Gly Leu He Lys Gly Gly Ala Asp He Leu Leu 
145 150 155 160 

etc gaa acg age cag gat atg cga aac gtg aag gcg get tat tta gga 528 
Leu Glu Thr Ser Gin Asp Met Arg Asn Val Lys Ala Ala Tyr Leu Gly 
165 170 175 

ctg age caa gcg caa aaa gag eta gag gtg aaa ctg cct etc att att 576 
Leu Ser Gin Ala Gin Lys Glu Leu Glu Val Lys Leu Pro Leu He He 
180 185 190 

tct gga acg att gaa ccg atg gga aca acg etc gee ggc caa aac ate 624 
Ser Gly Thr He Glu Pro Met Gly Thr Thr Leu Ala Gly Gin Asn He 
195 200 205 

gag gcg ttc tat ttg tea tta gag cat atg aat ccc gtc gtt gtc ggt 672 
Glu Ala Phe Tyr Leu Ser Leu Glu His Met Asn Pro Val Val Val Gly 
210 215 220 

etc aac tgc get aca gga cca gaa ttt atg cgc gat cac etc cgt tct 720 
Leu Asn Cys Ala Thr Gly Pro Glu Phe Met Arg Asp His Leu Arg Ser 
225 230 235 " 240 

ctt tea gac ctt gcg acc tgc tct gta age tgt tat ccg aat get ggg 768 
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Leu Ser Asp Leu Ala Thr Cys Ser Val Ser Cys Tyr Pro Asn Ala Gly 
245 250 255 

tta cct gat gaa gag ggg aac tat cac gaa tec cca gaa tca„tta gca 816 
Leu' Pro Asp Glu Glu Gly Asn Tyr His Glu Ser Pro Glu Ser Leu Ala 
260 265 270 

gec aag etc gca ggt ttt gcg gaa aag ggc tgg ttg aat atg gtt ggt 864 
Ala Lys Leu Ala Gly Phe Ala Glu Lys Gly Trp Leu Asn Met Val Gly 
275 280 285 

ggc tgt tgc ggg acg act cca gac cac att cgt get ctt ttg gac gtt 912 
Gly Cys Cys Gly Thr Thr Pro Asp His He Arg Ala Leu Leu Asp Val 
290 295 300 

atg aag caa ttt gag ccg aga caa cca aaa ggg gat cac ccc cac teg 960 
Met Lys Gin Phe Glu Pro Arg Gin Pro Lys Gly Asp His Pro His Ser 
305 310 315 320 

gtc tea gga att gag cca ctg tta tac gat gac age atg cgt cca eta 1008 
Val Ser Gly He Glu Pro Leu Leu Tyr Asp Asp Ser Met Arg Pro Leu 
325 330 335 

ttt gtc ggt gaa egg aca aac gtc ate ggg tct cgt aaa ttt aaa egg 1056 
Phe Val Gly Glu Arg Thr Asn Val He Gly Ser Arg Lys Phe Lys Arg 
340 345 350 

ttg ate gaa gaa gaa aaa tat gaa gaa gec tea gaa att gca, aga tec 1104 
Leu He Glu Glu Glu Lys Tyr Glu Glu Ala Ser Glu He Ala Arg Ser 
355 360 365 

caa gtg aag aaa ggg gee cac gtt ate gat gtt tgt ctt get gat ccg 1152 
Gin Val Lys Lys Gly Ala His Val He Asp Val Cys Leu Ala Asp Pro 
370 375 380 

gat cgc gat gaa atg gag gac atg gag gaa ttt tta aaa ttc gtg ate 1200 
Asp Arg Asp Glu Met Glu Asp Met Glu Glu Phe Leu Lys Phe Val He 
385 390 395 " 400 

aac aaa gtg aag gta ccg etc atg att gac tec ace gac gaa aag gta 1248 
Asn Lys Val Lys Val Pro Leu Met He Asp Ser Thr Asp Glu Lys Val 
405 410 415 

att gaa caa gcg ctt acg tat tea caa ggg aaa gcg ate att aat teg 1296 
He Glu Gin Ala Leu Thr Tyr Ser Gin Gly Lys Ala He He Asn Ser 
420 425 " 430 

ate aac tta gag gac ggc gaa gaa cgt ttt gaa aaa gtg gtc ccg etc 1344 
He Asn Leu Glu Asp Gly Glu Glu Arg Phe Glu Lys Val Val Pro Leu 
435 440 445 

gtc cat aag tat gga gee gcg gtt gtc gtt ggt acg ate gac gaa gaa 1392 
Val His Lys Tyr Gly Ala Ala Val Val Val Gly Thr He Asp Glu Glu 
450 455 460 

gga atg gcg att acg gca gaa aaa aaa tta gcg gtt gcg aaa cga tea 1440 
Gly Met Ala He Thr Ala Glu Lys Lys Leu Ala Val Ala LyB Arg Ser 
465 470 475 480 

tac gac ctg etc gta aac aaa tac aac att cgt ccg age gat att att 14 88 
Tyr Asp Leu Leu Val Asn Lys Tyr Asn He Arg Pro Ser Asp He He 
485 490 495 
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ttt gat ccg etc gtg ttc cca gta gga aca ggc gat gag caa tac att 1536 
Phe Asp Pro Leu Val Phe Pro Val Gly Thr Gly Asp Glu Gin Tyr He 
500 505 510 

ggc teg gcg aat gag acg gtg gaa gga att agg agg ate aaa gaa gag 1584 
Gly Ser Ala Asn Glu Thr Val Glu Gly He Arg Arg He Lys Glu Glu 
515 520 525 

etc cct gaa tgt tta acg att ctt gga gtt agt aac gtg teg ttc ggt 1632 
Leu Pro Glu Cys Leu Thr He Leu Gly Val Ser Asn Val Ser Phe Gly 
530 535 540 

ctt ccg cct gtc gga aga gag gtg ctg aac gcg gcg tac tta tac cat 16B0 
Leu Pro Pro Val Gly Arg Glu Val Leu Asn Ala Ala Tyr Leu Tyr His 
545 550 555 560 

tgt aca caa get ggc ctt gat tac get ate gtg aac aca gaa aag ctt 1728 
Cys Thr Gin Ala Gly Leu Asp Tyr Ala He Val Asn Thr Glu Lys Leu 
565 570 575 

gag cgt tat gec teg att tct gat gaa gaa aaa gaa ttg tea agg aag 1776 
Glu Arg Tyr Ala Ser He Ser Asp Glu Glu Lys Glu Leu Ser Arg Lys 
580 585 590 

etc tta ttt gaa acg aca gat gaa acg etc get gag ttc ace gec ttt 1824 
Leu Leu Phe Glu Thr Thr Asp Glu Thr Leu Ala Glu Phe Thr Ala Phe 
595 600 605 

tat cga ggg aaa aaa gca gag aaa aaa gtg gag act tct aat tta act 1872 
Tyr Arg Gly Lys Lys Ala Glu Lys Lys Val Glu Thr Ser Asn Leu Thr 
610 615 620 

ttg gaa gag egg ttg gca aac tac att gtt gaa ggg tea aag gac gga 1920 
Leu Glu Glu Arg Leu Ala Asn Tyr He Val Glu Gly Ser Lys Asp Gly 
625 630 635 640 

ctg aca gaa gat tta gat aaa gcg etc gcg aaa tat gat gat ccg ctt 1968 
Leu Thr Glu Asp Leu Asp Lys Ala Leu Ala Lys Tyr Asp Asp Pro Leu 
645 650 655 

gat ate att aac ggc ccg etc atg aat gga atg gac gaa gtc ggt cgt 2016 
Asp He He Asn Gly Pro Leu Met Asn Gly Met Asp Glu Val Gly Arg 
660 665 670 

ttg ttt aac aat aac gag ctt att gtc get gaa gta ttg caa age get 2064 
Leu Phe Asn ABn Asn Glu Leu He Val Ala Glu Val Leu Gin Ser Ala 
675 680 685 

gag gtt atg aag get tec gtc gee cac ctt gag cca cat atg gaa aag 2112 
Glu Val Met Lys Ala Ser Val Ala His Leu Glu Pro His Met Glu Lys 
690 695 700 



aaa gca gac gat cat gga aaa gga aaa 

Lys Ala Asp Asp His Gly Lys Gly Lys 

705 710 

ggc gat gtt cac gat ate ggg aaa aat 

Gly Asp Val His Asp He Gly Lys Asn 
725 

aat aat ggt ttc cgc ate gtg aac eta 



ate att ctt gee acg gtc aag 2160 

He He Leu Ala Thr Val Lys 

715 720 

eta gtg gaa att att ttg age 2208 

Leu Val Glu He He Leu Ser 
730 735 

gga att aaa gtt ace tct aat 2256 
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Asn Asn Gly Phe Arg He Val Asn Leu Gly He Lys Val Thr Ser Asn 
740 745 750 

gag ctg att gaa gcg gtg gcg aga gaa aat cca gat gcg att ggc ttg 2304 
Glu Leu He Glu Ala Val Ala Arg Glu Asn Pro Asp Ala He Gly Leu 
755 760 765 

tea ggg ttg etc gtc aaa tea gca caa caa atg gta ctt acc gec caa 2352 
Ser Gly Leu Leu Val Lys Ser Ala Gin Gin Met Val Leu Thr Ala Gin 
770 775 780 

gat ttg aag caa caa caa att tec att ccg att tta gtc gga ggc gca 2400 
Asp Leu Lys Gin Gin Gin He Ser He Pro He Leu Val Gly Gly Ala 
785 790 795 800 

gec ctt acg egg aaa ttt acg aat aca aaa ate get cca gag tat gat 2448 
Ala Leu Thr Arg Lys Phe Thr Asn Thr Lys He Ala Pro Glu Tyr Asp 
805 810 8X5 

ggt etc gtc gtc tac gcg aag gat gcg atg aac ggg tta gag ctt gec 2496 
Gly Leu Val Val Tyr Ala Lys Asp Ala Met Asn Gly Leu Glu Leu Ala 
820 825 830 

aat aaa tta atg aaa cct gat gaa cga gaa aag eta gcg gtc tec etc 2544 
Asn Lys Leu Met Lys Pro Asp Glu Arg Glu Lys Leu Ala Val Ser Leu 
835 840 845 

cat gaa gcg aag gag cag gcg aac teg agg aca caa atg gga gga ggc 2592 
His Glu Ala Lys Glu Gin Ala Asn Ser Arg Thr Gin Met Gly Gly Gly 
B50 855 860 

gga act gca gtt gcg gta aag ccg act cga tec cat gtt teg aca acg 2640 
Gly Thr Ala Val Ala Val Lys Pro Thr Arg Ser His Val Ser Thr Thr 
865 870 875 880 

gtg cct gta gcg gtc cca cct gat gtg aag ccg cac att ttg cgc cac 2688 
Val Pro Val Ala Val Pro Pro Asp Val Lys Pro His He Leu Arg His 
885 890 895 

cat age att gec cat tta gag ccg tat att aac atg cag atg ttg tta 2736 
His Ser He Ala His Leu Glu Pro Tyr He Asn Met Gin Met Leu Leu 
300 905 910 

gga cgt cac tta ggc tta caa ggg aaa gtg age cgc ctg ctt gca gaa 2784 
Gly Arg His Leu Gly Leu Gin Gly Lys Val Ser Arg Leu Leu Ala Glu 
915 920 925 

aaa gac gag aag get ctt gaa tta aaa gaa aaa gtt gat gcg eta etc 2832 
Lys Asp Glu Lys Ala Leu Glu Leu Lys Glu Lys Val Asp Ala Leu Leu 
930 935 940 



acc agg gtg aaa gag gag cag etc atg gaa gee cat ggc atg tat cag 
Thr Arg Val Lys Glu Glu Gin Leu Met Glu Ala His Gly Met Tyr Gin 
945 950 955 960 



2880 



ttt ttt cct gee cag teg gat ggg gac gat att gtc att tat gat caa 2928 
Phe Phe Pro Ala Gin Ser Asp Gly Asp Asp lie Val lie Tyr Asp Gin 
965 970 975 

acg gga aca aat gaa ate gag cga ttc cat ttt ccg cgt cag aat aag 2976 
Thr Gly Thr Asn Glu He Glu Arg Phe His Phe Pro Arg Gin Asn Lys 
980 985 990 
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gag cct tat ctg tgt ctt gcc gat ttc ctt cgc cca gtt tec agt ggg 3024 
Glu Pro Tyr Leu Cys Leu Ala Asp Phe Leu Arg Pro Val Ser Ser Gly 
995 1000 ~ 1005 

gaa atg gac tat gtt ggc ttc ctt get gta ace gca gga aaa ggc att 3072 
Glu Met Asp Tyr Val Gly Phe Leu Ala Val Thr Ala Gly Lys Gly He 
1010 1015 1020 

cgt gaa tta ggg gag cag gcg aaa gag get gga gac tat tta ttc agt 3120 
Arg Glu Leu Gly Glu Gin Ala Lys Glu Ala Gly Asp Tyr Leu Phe Ser 
1025 1030 1035 1040 

cac tta ate caa gca aca gcc tta gag atg gcg gaa ggg ttt gcc gag 3168 
His Leu He Gin Ala Thr Ala Leu Glu Met Ala Glu Gly Phe Ala Glu 
1045 1050 1055 

cgt gtc cat cag etc atg cgt gat aag tgg ggg ttt cct gat teg get 3216 
Arg Val His Gin Leu Met Arg Asp Lys Trp Gly Phe Pro Asp Ser Ala 
1060 1065 1070 

gac ttt aca atg gaa gag cgt ttc get gca aaa tac cgt ggc ate cgt 3264 
Asp Phe Thr Met Glu Glu Arg Phe Ala Ala Lys Tyr Arg Gly He Arg 
1075 1080 1085 

gta teg ttt ggc tac cct gca tgc cct gac ttg gat gac caa gca aag 3312 
Val Ser Phe Gly Tyr Pro Ala Cys Pro Asp Leu Asp Asp Gin Ala Lys 
1090 1095 1100 

ttg ttt aag ctg ttg aag cct gga aag ate gga att gag ttg acg gaa 3360 
Leu Phe Lys Leu Leu Lys Pro Gly Lys He Gly He Glu Leu Thr Glu 
1105 mo ins 1120 

ggg ttt atg atg gag cca gaa gcc tec gtc acc gcg atg gtg ttt gcc 3408 
Gly Phe Met Met Glu Pro Glu Ala Ser Val Thr Ala Met Val Phe Ala 
1125 H30 H35 

cat cct gag get cgc tat ttt aat gtt tta tag 3441 
His Pro Glu Ala Arg Tyr Phe Asn Val Leu 
1140 H45 



<210> 12 
<211> 1146 
<212> PRT 

<213> Bacillus halodurans 
<400> 12 

Met Thr Lys Ser Leu Phe Glu Gin Gin Leu Glu Arg Lys He Val He 
1 5 10 15 

Leu Asp Gly Ala Met Gly Thr Met Leu Gin Ala Ala Asn Leu Thr Ala 
20 25 30 

Asp Asp Phe Gly Gly Glu Glu Tyr Glu Gly Cys Asn Glu Tyr Leu Asn 
35 40 45 

Glu Thr Ala Pro His Val Val Glu Asp He His Arg Ala Tyr Leu Glu 
50 55 60 

Ala Gly Ala Asp Val lie Ala Thr Asn Thr Phe Gly Ala Thr Asp He 
65 70 75 80 
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Val Leu Asp Asp Tyr Asp Leu Gly Tyr Lys Ala Glu Glu Leu Asn He 
85 90 9S 

Cys Ala Val Lys He Ala Lys Arg Val Ala Glu Glu Phe Ser Thr Pro 
100 105 110 

Asp Trp Pro Arg Phe Val Ala Gly Ala Met Gly Pro Thr Thr Lys Ser 
115 120 125 

Leu Ser Val Thr Gly Gly Ala Thr Phe Glu Gin Leu He Glu Ser Tyr 
130 135 140 

Arg Gin Gin Ala Thr Gly Leu He Lys Gly Gly Ala Asp He Leu Leu 
"5 150 155 160 

Leu Glu Thr Ser Gin Asp Met Arg Asn Val Lys Ala Ala Tyr Leu Gly 
165 170 175 

Leu Ser Gin Ala Gin Lys Glu Leu Glu Val Lys Leu Pro Leu He He 
180 185 190 

Ser Gly Thr He Glu Pro Met Gly Thr Thr Leu Ala Gly Gin Asn He 
195 200 205 

Glu Ala Phe Tyr Leu Ser Leu Glu His Met Asn Pro Val Val Val Gly 
210 215 220 

Leu Asn Cys Ala Thr Gly Pro Glu Phe Met Arg Asp His Leu Arg Ser 
225 230 235 240 

Leu Ser Asp Leu Ala Thr Cys Ser Val Ser Cys Tyr Pro Asn Ala Gly 
245 250 255 

Leu Pro Asp Glu Glu Gly Asn Tyr His Glu Ser Pro Glu Ser Leu Ala 
260 265 270 

Ala Lys Leu Ala Gly Phe Ala Glu Lys Gly Trp Leu Asn Met Val Gly 
275 280 285 

Gly Cys Cys Gly Thr Thr Pro Asp His He Arg Ala Leu Leu Asp Val 
290 295 300 

Met Lys Gin Phe Glu Pro Arg Gin Pro Lys Gly Asp His Pro His Ser 
3 °5 310 315 320 

Val Ser Gly He Glu Pro Leu Leu Tyr Asp Asp Ser Met Arg Pro Leu 
325 330 335 

Phe Val Gly Glu Arg Thr Asn Val He Gly Ser Arg Lys Phe Lys Arg 
340 345 " 350 

Leu He Glu Glu Glu Lys Tyr Glu Glu Ala Ser Glu He Ala Arg Ser 
355 360 365 

Gin Val Lys LyB Gly Ala His Val He Asp Val Cys Leu Ala Asp Pro 
370 375 380 

Asp Arg Asp Glu Met Glu Asp Met Glu Glu Phe Leu Lys Phe Val He 
385 390 395 400 

Asn Lys Val Lys Val Pro Leu Met He Asp Ser Thr Asp Glu Lys Val 
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405 410 415 

He Glu Gin Ala Leu Thr Tyr Ser Gin Gly Lys Ala He He Asn Ser 
420 425 . 430, 

He Asn Leu Glu Asp Gly Glu Glu Arg Phe Glu Lys Val Val Pro Leu 
435 440 445 

Val His Lys Tyr Gly Ala Ala Val Val Val Gly Thr He Asp Glu Glu 
450 455 460 

Gly Met Ala He Thr Ala Glu Lys Lys Leu Ala Val Ala Lys Arg Ser 
4fi 5 470 475 4B0 

Tyr Asp Leu Leu Val Asn Lys Tyr Asn He Arg Pro Ser Asp He lie 
485 490 495 

Phe Asp Pro Leu Val Phe Pro Val Gly Thr Gly Asp Glu Gin Tyr He 
500 505 510 

Gly Ser Ala Asn Glu Thr Val Glu Gly lie Arg Arg He Lys Glu Glu 
515 520 525 

Leu Pro Glu Cys Leu Thr He Leu Gly Val Ser Asn Val Ser Phe Gly 
530 535 540 

Leu Pro Pro Val Gly Arg Glu Val Leu Asn Ala Ala Tyr Leu Tyr His 
545 550 555 560 

Cys Thr Gin Ala Gly Leu Asp Tyr Ala lie Val Asn Thr Glu Lys Leu 
565 570 575 

Glu Arg Tyr Ala Ser He Ser Asp Glu Glu Lys Glu Leu Ser Arg Lys 
580 5B5 590 

Leu Leu Phe Glu Thr Thr Asp Glu Thr Leu Ala Glu Phe Thr Ala Phe 
595 600 605 

Tyr Arg Gly Lys Lys Ala Glu Lys Lys Val Glu Thr Ser Asn Leu Thr 
610 615 620 

Leu Glu Glu Arg Leu Ala Asn Tyr He Val Glu Gly Ser Lys Asp Gly 
625 630 635 640 

Leu Thr Glu Asp Leu Asp Lys Ala Leu Ala Lys Tyr Asp Asp Pro Leu 
645 650 655 

Asp He lie Asn Gly Pro Leu Met Asn Gly Met Asp Glu Val Gly Arg 
660 665 670 

Leu Phe Asn Asn Asn Glu Leu He Val Ala Glu Val Leu Gin Ser Ala 
675 680 685 

Glu Val Met Lys Ala Ser Val Ala His Leu Glu Pro His Met Glu Lys 
690 695 700 

Lys Ala Asp Asp His Gly Lys Gly Lys lie lie Leu Ala Thr Val Lys 
705 710 715 720 

Gly Asp Val His Asp lie Gly Lys Asn Leu Val Glu lie lie Leu Ser 
725 730 735 
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Asn Asn Gly Phe Arg He Val Asn Leu Gly He Lys Val Thr Ser Asn 
740 745 750 

Glu Leu He Glu Ala Val Ala Arg Glu Asn Pro Asp Ala He Gly Leu 
755 760 765 

Ser Gly Leu Leu Val Lys Ser Ala Gin Gin Met Val Leu Thr Ala Gin 
770 775 780 

Asp Leu Lys Gin Gin Gin He Ser He Pro He Leu Val Gly Gly Ala 
785 790 795 800 

Ala Leu Thr Arg Lys Phe Thr Asn Thr Lys He Ala Pro Glu Tyr Asp 
805 810 815 

Gly Leu Val Val Tyr Ala Lys Asp Ala Met Asn Gly Leu Glu Leu Ala 
820 825 830 

Asn Lye Leu Met Lys Pro Asp Glu Arg Glu Lys Leu Ala Val Ser Leu 
835 840 845 

His Glu Ala Lys Glu Gin Ala Asn Ser Arg Thr Gin Met Gly Gly Gly 
850 855 860 

Gly Thr Ala Val Ala Val Lys Pro Thr Arg Ser His Val Ser Thr Thr 
865 870 875 880 

Val Pro Val Ala Val Pro Pro Asp Val Lys Pro His He Leu Arg His 
885 890 895 

His Ser He Ala His Leu Glu Pro Tyr He Asn Met Gin Met Leu Leu 
900 905 910 

Gly Arg His Leu Gly Leu Gin Gly Lys Val Ser Arg Leu Leu Ala Glu 
915 920 925 

Lys Asp Glu Lys Ala Leu Glu Leu Lys Glu Lys Val Asp Ala Leu Leu 
930 935 940 

Thr Arg Val Lys Glu Glu Gin Leu Met Glu Ala His Gly Met Tyr Gin 
945 950 955 960 

Phe Phe Pro Ala Gin Ser Asp Gly Asp Asp He Val lie Tyr Asp Gin 
965 970 975 

Thr Gly Thr Asn Glu He Glu Arg Phe His Phe Pro Arg Gin Asn Lys 
980 985 990 

Glu Pro Tyr Leu Cys Leu Ala Asp Phe Leu Arg Pro Val Ser Ser Gly 
995 1000 ~ 1005 

Glu Met Asp Tyr Val Gly Phe Leu Ala Val Thr Ala Gly Lys Gly He 
1010 1015 1020 

Arg Glu Leu Gly Glu Gin Ala Lys Glu Ala Gly Asp Tyr Leu Phe Ser 
1025 1030 1035 1040 

His Leu He Gin Ala Thr Ala Leu Glu Met Ala Glu Gly Phe Ala Glu 
1045 1050 1055 

Arg Val His Gin Leu Met Arg Asp Lys Trp Gly Phe Pro Asp Ser Ala 
1060 1065 1070 
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Asp Phe Thr Met Glu Glu Arg Phe Ala Ala Lys Tyr Arg Gly lie Arg 
1075 1080 10B5 

Val' Ser Phe Gly Tyr Pro Ala Cys Pro Asp Leu Asp Asp Gin Ala Lys 
1090 1095 iioo 

Leu Phe Lys Leu Leu Lys Pro Gly Lys lie Gly lie Glu Leu Thr Glu 
1105 llio ins 1120 

Gly Phe Met Met Glu Pro Glu Ala Ser Val Thr Ala Met Val Phe Ala 
1125 1130 1135 

His Pro Glu Ala Arg Tyr Phe Asn Val Leu 
1140 1145 



.<210> 13 
<211> 3411 
<212> DNA 

<213> Bacillus stearothermophilus 

<220> 
<221> CDS 
<222> (1) . . (3408) 
<223> RBE02044 

<400> 13 

atg get aac gtc acc tta gaa cag caa ctg caa aga aaa att ctt gtc 48 

Met Ala Asn Val Thr Leu Glu Gin Gin Leu Gin Arg Lys lie Leu Val 

1 5 10 15 

ate gat ggc gec atg ggc acg atg ate caa age gee aac eta teg gec 96 
lie Asp Gly Ala Met Gly Thr Met lie Gin Ser Ala Asn Leu Ser Ala 
20 25 30 

gee gac ttt ggc ggc gag gcg tat gaa ggg tgc aac gaa tat ttg acc 144 
Ala Asp Phe Gly Gly Glu Ala Tyr Glu Gly CyB Asn Glu Tyr Leu Thr 
35 40 45 

etc acc gee ccg cat gtc ate cgc cgc att cat gaa gcg tac eta gaa 192 
Leu Thr Ala Pro His Val He Arg Arg He His Glu Ala Tyr Leu Glu 
50 55 60 

gee ggt get gat ate att gaa acg aac acg ttc gga gcg aca cgc ate 240 
Ala Gly Ala Asp He He Glu Thr Asn Thr Phe Gly Ala Thr Arg He 
65 70 75 80 

gtg ctt gac gaa tat ggc etc ggt cat ttg gcg ctt gag ctg aac ate 288 
Val Leu Asp Glu Tyr Gly Leu Gly His Leu Ala Leu Glu Leu Asn He 
85 90 95 

gaa gcg gec aaa etc gee aaa caa acg get gag teg ttc tec acc ccg 336 
Glu Ala Ala Lys Leu Ala Lys Gin Thr Ala Glu Ser Phe Ser Thr Pro 
!00 105 Ho 

gac tgg ccg cgc ttt gtc gee ggt teg atg ggg ccg acg acg aaa acg 384 
Asp Trp Pro Arg Phe Val Ala Gly Ser Met Gly Pro Thr Thr Lys Thr 
115 120 125 

ttg teg gtg aca ggc ggc gca acg ttt gaa gaa etc gtc gee gee tac 432 
Leu Ser Val Thr Gly Gly Ala Thr Phe Glu Glu Leu Val Ala Ala Tyr 
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130 135 140 

gaa gaa caa gcg cgc gga ctg etc tta gga ggc gtc gac ctt etc eta 480 
Glu Glu Gin Ala Arg Gly Leu Leu Leu Gly Gly Val Asp Leu Leu Leu 
145 150 155 160 

etc gag acg tgc caa gat acg ctg aat gtc aaa gec ggt ttt etc ggc 528 
Leu Glu Thr Cys Gin Aep Thr Leu Asn Val Lys Ala Gly Phe Leu Gly 
165 170 175 

att teg aag gcg ttt gaa gcg gtc ggc cgc cgc gtg ccg etc atg att 576 
He Ser Lys Ala Phe Glu Ala Val Gly Arg Arg Val Pro Leu Met He 
180 185 190 

tec ggc acg ate gaa ccg atg ggc acg acg etc gee ggg cag gcg ate 624 
Ser Gly Thr He Glu Pro Met Gly Thr Thr Leu Ala Gly Gin Ala He 
195 200 205 

gat gcg ttt ttc ate teg gtg cgc cat atg aag ccg ate gec gtc ggc 672 
Asp Ala Phe Phe He Ser Val Arg His Met Lys Pro He Ala Val Gly 
210 215 220 

tta aac tgc gca acc ggt ccg gag ttt atg ace gac cat ttg cgc acg 720 
Leu Asn Cys Ala Thr Gly Pro Glu Phe Met Thr Asp His Leu Arg Thr 
225 230 235 240 

etc gee teg etc get gac acg gcg gtc age tgc tac ccg aac gec ggt 768 
Leu Ala Ser Leu Ala Asp Thr Ala Val Ser Cys Tyr Pro Asn Ala Gly 
245 250 255 

ctg ccg gat gag gaa ggc cac tat cat gaa acg ccg aat atg ctg gca 816 
Leu Pro Asp Glu Glu Gly His Tyr His Glu Thr Pro Asn Met Leu Ala 
260 265 270 

gag aaa ate cgc cgc ttt gec gaa aag gga tgg ate aac ate gtc ggc 864 
Glu Lys He Arg Arg Phe Ala Glu Lys Gly Trp He Asn He Val Gly 
275 280 285 

ggg tgt tgc ggc acg acg ccg gat cat ate cgc gec att get gaa gcg 912 
Gly Cys Cys Gly Thr Thr Pro Asp His He Arg Ala He Ala Glu Ala 
290 295 300 

gtg cgt gat etc ccg ccg egg gcg att ccg tct teg ttt gat gtc cac 960 
Val Arg Asp Leu Pro Pro Arg Ala He Pro Ser Ser Phe Asp Val His 
305 310 315 320 

gee gtt tec ggc ate gag gcg etc ate tat gat gaa acg atg cgc ccg 1008 
Ala Val Ser Gly He Glu Ala Leu He Tyr Asp Glu Thr Met Arg Pro 
325 330 335 

etc ttt gtc ggc gag egg aca aac gtg ate ggc teg cgc aaa ttc aag 1056 
Leu Phe Val Gly Glu Arg Thr Asn Val He Gly Ser Arg Lys Phe Lys 
340 345 350 

cgc etc ate gee gaa ggg aaa tac gaa gaa gcg gcg gaa ate gec cgc 1104 
Arg Leu He Ala Glu Gly Lys Tyr Glu Glu Ala Ala Glu He Ala Arg 
355 360 365 

gee caa gtg aaa aac ggc gec cat gtc ate gac att tgc etc gee gac 1152 
Ala Gin Val Lys Asn Gly Ala His Val He Asp He Cys Leu Ala Asp 
370 375 380 
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cca gac cgc gac gaa etc cat gac atg gag cag ttc gtc cgc gaa gtc 1200 
Pro Asp Arg Asp Glu Leu His Asp Met Glu Gin Phe Val Arg Glu Val 
385 390 395 400 

gtg aaa aaa gtg aaa gtg ccg ctt gtc ate gat teg acc gac gag cgc 1248 
Val Lys Lys Val Lys Val Pro Leu Val He Asp Ser Thr Asp Glu Arg 
405 410 415 

gtc ate gaa cgc gee ctt acg tat teg caa ggg aag gcg ate ate aac 1296 
Val He Glu Arg Ala Leu Thr Tyr Ser Gin Gly Lys Ala He He Asn 
420 425 430 

teg ate aac etc gaa gat ggc gaa gag egg ttt gcg aag gtc gtt cct 1344 
Ser He Asn Leu Glu Asp Gly Glu Glu Arg Phe Ala Lys Val Val Pro 
435 440 445 

etc ctg cat caa tac ggc gee gee gtt gtc gtc ggc acg ate gat gag 1392 
Leu Leu His Gin Tyr Gly Ala Ala Val Val Val Gly Thr He Asp Glu 
450 455 460 

caa gga atg gcg gtt aca gee gaa egg aaa ttg gaa ate gec ttg cgt 1440 
Gin Gly Met Ala Val Thr Ala Glu Arg Lys Leu Glu He Ala Leu Arg 
465 470 475 480 

teg tat gac ttg ctg gtg aac cgc tac ggc gtc ccc gag cgc gac ate 1488 
Ser Tyr Asp Leu Leu Val Asn Arg Tyr Gly Val Pro Glu Arg Asp He 
485 490 495 

att ttc gac ccg etc gtc ttc ccg gtc ggc acc ggc gat gag caa tac 1536 
He Phe Asp Pro Leu Val Phe Pro Val Gly Thr Gly Asp Glu Gin Tyr 
500 505 510 

ate ggc gcg gcg aaa gaa acc att gag ggc ate cgc etc att aaa gag 1584 
He Gly Ala Ala Lys Glu Thr He Glu Gly He Arg Leu He Lys Glu 
515 520 525 

egg ctg cct cat tgc ttg acg atg ctt ggc ate age aac gtc teg ttc 1632 
Arg Leu Pro His Cys Leu Thr Met Leu Gly He Ser Asn Val Ser Phe 
530 535 540 

ggc ttg ccg ccg gee gga cgc gag gtg etc aac tec gtc ttt ttg tac 1680 
Gly Leu Pro Pro Ala Gly Arg Glu Val Leu Asn Ser Val Phe Leu Tyr 
545 550 555 560 

cat tgc acg caa gec ggg etc gat tac gee ate gtc aac acc gag aaa 1728 
His Cys Thr Gin Ala Gly Leu Asp Tyr Ala He Val Asn Thr Glu Lys 
565 570 575 

ttg gag egg ttc gec teg att ccg gaa gag gaa gtg cga atg get gag 1776 
Leu Glu Arg Phe Ala Ser He Pro Glu Glu Glu Val Arg Met Ala Glu 
580 585 590 

gca ctt ctt ttt gac aca aac gac gaa aca tta aac gec ttt ate gaa 1824 
Ala Leu Leu Phe Asp Thr Asn Asp Glu Thr Leu Asn Ala Phe He Glu 
595 600 605 

ttt tac cga age aaa ate acc gee gec aaa ccg gcg cag acg aac ttg 1872 
Phe Tyr Arg Ser Lys He Thr Ala Ala Lys Pro Ala Gin Thr Asn Leu 
610 615 620 

age ttg gaa gag egg etc gee cgc tac gtt att gaa ggg teg aaa gac 1920 
Ser Leu Glu Glu Arg Leu Ala Arg Tyr Val He Glu Gly Ser Lys Asp 
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625 630 635 640 

ggg etc att etc gat ttg gaa aag gcg ctt gag acc tac tec gat ccg 1968 
Gly Leu He Leu Asp Leu Glu Lys Ala Leu Glu Thr Tyr Ser Asp Pro 
645 650 655 

ctg tec ate ate aac ggt ccg etc atg gec ggc atg gat gaa gtc ggg 2016 
Leu Ser He He Asn Gly Pro Leu Met Ala Gly Met Asp Glu Val Gly 
660 665 670 

egg ctg ttc aac aac aac cag etc ate gtc get gaa gta ttg caa age 2064 
Arg Leu Phe Asn Asn Asn Gin Leu He Val Ala Glu Val Leu Gin Ser 
675 680 685 

gcg gaa gtg atg aaa gca gcg gtc gee ttt tta gag ctg tat atg gaa 2112 
Ala Glu val Met Lys Ala Ala Val Ala Phe Leu Glu Leu Tyr Met Glu 
690 695 700 

aag aaa gaa gga age aca aaa gga aaa gtc att etc gee acc gtc aaa 2160 
Lys Lys Glu Gly Ser Thr Lys Gly Lys Val He Leu Ala Thr Val Lys 
705 710 715 720 

ggc gat gtg cat gac ate ggc aaa aac ttg gtc gac ate att tta age 2208 
Gly Asp Val His Asp He Gly Lys Asn Leu Val Asp He He Leu Ser 
725 730 * 735 

aac aac ggc tac gag gtg ate gac etc ggc att aaa gtc get ccg cag 2256 
Asn Asn Gly Tyr Glu Val He Asp Leu Gly He Lys Val Ala Pro Gin 
740 745 750 

caa etc att gaa gcg gtg cgc gaa cat cag ccg gac ate ate ggg ttg 2304 
Gin Leu He Glu Ala Val Arg Glu His Gin Pro Asp He He Gly Leu 
755 760 765 

teg ggc ttg ctt gtg aaa teg get caa cag atg gtc gtc acc gee caa 2352 
Ser Gly Leu Leu Val Lys Ser Ala Gin Gin Met Val Val Thr Ala Gin 
770 775 780 

gac ttg cgc caa gcg ggc ate teg acc ccg att tta gtc ggc ggc gee 2400 
Asp Leu Arg Gin Ala Gly He Ser Thr Pro He Leu Val Gly Gly Ala 
785 790 795 800 

gee ttg acg cgc aaa ttt acg gaa aac aaa ate gcg ccc gag tac gac 2448 
Ala Leu Thr Arg Lys Phe Thr Glu Asn Lys He Ala Pro Glu Tyr Asp 
80S 810 815 

ggc gtt gtc ttg tac gcg aaa gac gee atg gac ggg etc gee ctt gee 2496 
Gly Val Val Leu Tyr Ala Lys Asp Ala Met Asp Gly Leu Ala Leu Ala 
820 825 830 



aac caa ate cag cag ggc gag att gac tac aag aaa aaa gaa acg gee 
Asn Gin He Gin Gin Gly Glu He Asp Tyr Lys Lys Lys Glu Thr Ala 
835 840 * 845 



2544 



gaa age gag cca acg egg caa acg acg gtg gtc aca gcg gtc aaa teg 2592 
Glu Ser Glu Pro Thr Arg Gin Thr Thr Val Val Thr Ala Val Lys Ser 
850 855 860 



acc gtc teg acc gac gtt ccc gtc tac ate ccg gee gat etc gag cgc 
Thr Val Ser Thr Asp Val Pro Val Tyr He Pro Ala ABp Leu Glu Arg 
865 870 875 880 



2640 
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cac gcg ctg cga aat gtg ccg ctt gac cac att ttg ccg tac gtc aac 2688 

HiB Ala Leu Arg Ann Val Pro Leu Asp His He Leu Pro Tyr Val Asn 
885 890 895 

tgg caa atg gtg etc ggc cac cac etc ggc ttg aaa gga aaa' gtg aaa 2736 
Trp Gin Met Val Leu Gly His His Leu Gly Leu Lye Gly Lys Val Lys 
900 905 910 

egg ctg ctt gaa gag aaa gac gaa aaa gcg ttg gcg tta aaa gcg gtc 2784 
Arg Leu Leu Glu Glu LyB Asp Glu Lys Ala Leu Ala Leu Lys Ala Val 
915 920 925 "' 

gtc gac gaa ctg etc gec gaa gcg aaa gag cgc cgc tgg att cag ccc 2 832 
Val Asp Glu Leu Leu Ala Glu Ala Lys Glu Arg Arg Trp He Gin Pro 
930 935 940 

gec ggc gtc tac cgc ttc ttc ccg gcg caa age gac ggc aac egg gtt 2880 
Ala Gly Val Tyr Arg Phe Phe Pro Ala Gin Ser Asp Gly Asn Arg Val 
945 950 955 960 

tac att tac gat ccg act gac ggc aaa aca gtg etc gag atg ttc gac 2928 
Tyr He Tyr Asp Pro Thr Asp Gly Lys Thr Val Leu Glu Met Phe Asp 
965 970 975 

ttt ccg cgc caa ccg egg gcg ccg tat ctt tgc etc gec gat tat ttg 2976 
Phe Pro Arg Gin Pro Arg Ala Pro Tyr Leu Cys Leu Ala Asp Tyr Leu 
980 985 990 

aaa teg aaa gaa age ggc gaa atg gat tac gtc ggt ttg ttc gec gtc 3024 
Lys Ser Lys Glu Ser Gly Glu Met Asp Tyr Val Gly Leu Phe Ala Val 
995 1000 1005 

acc get ggg cat ggc gtc cgc gaa etc gee cag cgc tgg aag gaa gaa 3072 
Thr Ala Gly His Gly Val Arg Glu Leu Ala Gin Arg Trp Lys Glu Glu 
1010 1015 1020 

ggc gaa ttt ttg aaa age cat gee ate caa gcg ttg gcg etc gag att 3120 
Gly Glu Phe Leu Lys Ser His Ala He Gin Ala Leu Ala Leu Glu He 
1025 1030 1035 1040 

gee gaa ggg ttc gee gaa cga ate cat caa att atg cgc gac cgc tgg 3168 
Ala Glu Gly Phe Ala Glu Arg He His Gin He Met Arg Asp Arg Trp 
1045 1050 1055 

ggc ttc ccg gac gac ccg gat ttc acg atg gaa gag cgc ttc gee gec 3216 
Gly Phe Pro Asp Asp Pro Asp Phe Thr Met Glu Glu Arg Phe Ala Ala 
1060 1065 1070 

aaa tac cag ggc cag cgc tac teg ttc ggc tac ccg gee tgt ccg aac 3264 
Lys Tyr Gin Gly Gin Arg Tyr Ser Phe Gly Tyr Pro Ala Cys Pro Asn 
1075 1080 1085 

ttg gaa gac cag gag aaa ctg ttc cgt ctg ctt cat cca gaa gac ate 3312 
Leu Glu Asp Gin Glu Lys Leu Phe Arg Leu Leu His Pro Glu Asp He 
1090 1095 iioo 

ggc ate cgt etc acc gac ggc tat atg atg gaa ccc gaa gca teg gtt 3360 
Gly He Arg Leu Thr Asp Gly Tyr Met Met Glu Pro Glu Ala Ser Val 
U05 mo 1115 1120 

teg gcg ate gtc ttc gee cat ccg gaa gcg egg tat ttc aat gtg tta 34 08 
Ser Ala He Val Phe Ala His Pro Glu Ala Arg Tyr Phe Asn Val Leu 
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1125 



1130 



1135 



taa 



3411 



<210> 14 
<211> 1136 
<212> PRT 

<213> Bacillus stearothermophilus 
<400> 14 

Met Ala Asn Val Thr Leu Glu Gin Gin Leu Gin Arg Lys lie Leu Val 
1 5 10 15 

He Asp Gly Ala Met Gly Thr Met He Gin Ser Ala Asn Leu Ser Ala 
20 25 30 

Ala Asp Phe Gly Gly Glu Ala Tyr Glu Gly Cys Asn Glu Tyr Leu Thr 
35 40 45 

Leu Thr Ala Pro His Val He Arg Arg He His Glu Ala Tyr Leu Glu 
50 55 60 

Ala Gly Ala Asp He He Glu Thr Asn Thr Phe Gly Ala Thr Arg He 
65 70 75 BO 

Val Leu Asp Glu Tyr Gly Leu Gly His Leu Ala Leu Glu Leu Asn He 
85 90 95 

Glu Ala Ala Lys Leu Ala Lys Gin Thr Ala Glu Ser Phe Ser Thr Pro 
100 105 HO 

Asp Trp Pro Arg Phe Val Ala Gly Ser Met Gly Pro Thr Thr Lys Thr 
115 120 125 

Leu Ser Val Thr Gly Gly Ala Thr Phe Glu Glu Leu Val Ala Ala Tyr 
130 135 140 

Glu Glu Gin Ala Arg Gly Leu Leu Leu Gly Gly val Asp Leu Leu Leu 
145 150 155 160 

Leu Glu Thr Cys Gin Asp Thr Leu Asn Val Lys Ala Gly Phe Leu Gly 
165 170 175 

He Ser Lys Ala Phe Glu Ala Val Gly Arg Arg Val Pro Leu Met He 
180 185 " 190 

Ser Gly Thr He Glu Pro Met Gly Thr Thr Leu Ala Gly Gin Ala He 
195 200 205 

Asp Ala Phe Phe He Ser Val Arg His Met Lys Pro He Ala Val Gly 
210 215 220 

Leu Asn Cya Ala Thr Gly Pro Glu Phe Met Thr Asp His Leu Arg Thr 
225 230 235 240 

Leu Ala Ser Leu Ala Asp Thr Ala Val Ser Cys Tyr Pro Asn Ala Gly 
245 250 255 

Leu Pro Asp Glu Glu Gly HiB Tyr His Glu Thr Pro Asn Met Leu Ala 



260 



265 



270 
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Glu Lye lie Arg Arg Phe Ala Glu Lys Qly Trp He Asn lie Val Gly 
275 " 280 285 

Gly Cys Cys Gly Thr Thr Pro Asp His lie Arg Ala He Ala Glu Ala 
290 295 300 

Val Arg Asp Leu Pro Pro Arg Ala He Pro Ser Ser Phe Asp Val His 
305 310 315 320 

Ala Val Ser Gly He Glu Ala Leu He Tyr Asp Glu Thr Met Arg Pro 
325 330 335 

Leu Phe Val Gly Glu Arg Thr Asn Val He Gly Ser Arg Lys Phe Lys 
340 345 350 

Arg Leu He Ala Glu Gly Lys Tyr Glu Glu Ala Ala Glu He Ala Arg 
355 360 365 

Ala Gin Val Lys Asn Gly Ala His Val He Asp He Cys Leu Ala Asp 
370 375 380 

Pro Asp Arg Asp Glu Leu His Asp Net Glu Gin Phe Val Arg Glu Val 
385 390 395 400 

Val Lys Lys Val Lys Val Pro Leu Val lie Asp Ser Thr Asp Glu Arg 
405 410 415 

Val He Glu Arg Ala Leu Thr Tyr Ser Gin Gly Lys Ala lie lie Asn 
420 425 430 

Ser lie Asn Leu Glu Asp Gly Glu Glu Arg Phe Ala Lys Val Val Pro 
435 440 445 

Leu Leu His Gin Tyr Gly Ala Ala Val Val Val Gly Thr lie Asp Glu 
450 455 460 

Gin Gly Met Ala Val Thr Ala Glu Arg Lys Leu Glu He Ala Leu Arg 
465 470 475 480 

Ser Tyr Asp Leu Leu Val Asn Arg Tyr Gly Val Pro Glu Arg Asp lie 
485 490 495 

lie Phe Asp Pro Leu Val Phe Pro Val Gly Thr Gly Asp Glu Gin Tyr 
500 505 510 

He Gly Ala Ala Lys Glu Thr lie Glu Gly lie Arg Leu lie Lys Glu 
515 520 525 

Arg Leu Pro HiB Cys Leu Thr Met Leu Gly lie Ser Asn Val Ser Phe 
530 535 540 

Gly Leu Pro Pro Ala Gly Arg Glu Val Leu Asn Ser Val Phe Leu Tyr 
545 550 555 560 

His Cye Thr Gin Ala Gly Leu Asp Tyr Ala lie Val Asn Thr Glu Lys 
565 570 575 

Leu Glu Arg Phe Ala Ser lie Pro Glu Glu Glu Val Arg Met Ala Glu 
580 585 590 

* Ala Leu Leu Phe Asp Thr Asn Asp Glu Thr Leu Asn Ala Phe lie Glu 
.595 600 605 
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Phe Tyr Arg Ser Lys He Thr Ala Ala Lys Pro Ala Gin Thr Asn Leu 
610 615 620 

Ser Leu Glu Glu Arg Leu Ala Arg Tyr Val He Glu Gly Ser Lys Asp 
625 630 635 640 

Gly Leu He Leu Asp Leu Glu Lys Ala Leu Glu Thr Tyr Ser Asp Pro 
645 650 655 

Leu Ser He He Asn Gly Pro Leu Met Ala Gly Met Asp Glu Val Gly 
660 665 670 

Arg Leu Phe Asn Asn Asn Gin Leu He Val Ala Glu Val Leu Gin Ser 
675 6B0 685 

Ala Glu Val Met Lys Ala Ala Val Ala Phe Leu Glu Leu Tyr Met Glu 
690 695 700 

Lys Lys Glu Gly Ser Thr Lys Gly Lys Val He Leu Ala Thr Val Lys 
705 710 715 720 

Gly Asp Val His Asp He Gly Lys Asn Leu Val Asp He lie Leu Ser 
725 730 735 

Asn Asn Gly Tyr Glu Val He Asp Leu Gly He Lys Val Ala Pro Gin 
740 745 750 

Gin Leu He Glu Ala Val Arg Glu His Gin Pro Asp He He Gly Leu 
755 760 765 

Ser Gly Leu Leu Val Lys Ser Ala Gin Gin Met Val Val Thr Ala Gin 
770 775 780 

Asp Leu Arg Gin Ala Gly He Ser Thr Pro He Leu Val Gly Gly Ala 
785 790 795 800 

Ala Leu Thr Arg Lys Phe Thr Glu Asn Lys He Ala Pro Glu Tyr Asp 
805 810 815 

Gly Val Val Leu Tyr Ala Lys Asp Ala Met Asp Gly Leu Ala Leu Ala 
820 825 830 

Asn Gin He Gin Gin Gly Glu He Asp Tyr Lys Lys Lys Glu Thr Ala 
835 840 845 

Glu Ser Glu Pro Thr Arg Gin Thr Thr Val Val Thr Ala Val Lys Ser 
850 855 860 

Thr Val Ser Thr Asp Val Pro Val Tyr He Pro Ala Asp Leu Glu Arg 
865 870 875 680 

His Ala Leu Arg Asn Val Pro Leu Asp His He Leu Pro Tyr Val Asn 
885 890 895 

Trp Gin Met Val Leu Gly His His Leu Gly Leu Lys Gly Lys Val Lys 
900 905 910 

Arg Leu Leu Glu Glu Lys Asp Glu Lys Ala Leu Ala Leu Lys Ala Val 
915 920 925 

Val Asp Glu Leu Leu Ala Glu Ala Lys Glu Arg Arg Trp He Gin Pro 
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930 935 940 

Ala Gly Val Tyr Arg Phe Phe Pro Ala Gin Ser Asp Gly Asn Arg Val 
9 « 950 955 4 960 

Tyr He Tyr Asp Pro Thr Asp Gly Lys Thr Val Leu Glu Met Phe Asp 
965 970 975 

Phe Pro Arg Gin Pro Arg Ala Pro Tyr Leu Cys Leu Ala Asp Tyr Leu 
980 985 99p 

Lys Ser Lys Glu Ser Gly Glu Met Asp Tyr Val Gly Leu Phe Ala Val 
995 1000 1005 

Thr Ala Gly His Gly Val Arg Glu Leu Ala Gin Arg Trp Lys Glu Glu 
1010 1015 1020 

Gly Glu Phe Leu Lys Ser His Ala He Gin Ala Leu Ala Leu Glu He 
1Q 25 1030 1035 1040 

Ala Glu Gly Phe Ala Glu Arg He His Gin He Met Arg Asp Arg Trp 
1045 1050 1055 

Gly Phe Pro Asp Asp Pro Asp Phe Thr Met Glu Glu Arg Phe Ala Ala 
1060 1065 1070 

Lys Tyr Gin Gly Gin Arg Tyr Ser Phe Gly Tyr Pro Ala Cys Pro Asn 
1075 1080 1085 

Leu Glu Asp Gin Glu Lys Leu Phe Arg Leu Leu His Pro Glu Asp lie 
1090 1095 1100 

Gly He Arg Leu Thr Asp Gly Tyr Met Met Glu Pro Glu Ala Ser Val 
1105 1110 ins H20 

Ser Ala He Val Phe Ala His Pro Glu Ala Arg Tyr Phe Asn Val Leu 
1125 H30 H35 



<210> 15 
<211> 3681 
<212> DNA 

<213> Vibrio cholerae 

<220> 
<221> CDS 
<222> (1) . . (3678) 
<223> RVC04265 

<400> 15 

gtg gga aaa gaa gta aga caa caa 

Val Gly Lys Glu Val Arg Gin Gin 
1 5 

ate eta ctg att gat ggt ggt atg 
He Leu Leu He Asp Gly Gly Met 
20 



etc gaa cag caa ttg aaa caa cgt 48 
Leu Glu Gin Gin Leu Lys Gin Arg 
10 15 

ggt acc atg att cag agt tat aag 96 
Gly Thr Met He Gin Ser Tyr Lys 
25 30 



tta caa gag gaa gac tat cgc ggt gca cga ttt gtc gat tgg cac tgt 144 
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Leu Gin Glu Glu Asp Tyr Arg Gly Ala Arg Phe Val Asp Trp His Cys 
35 40 45 

gat ttg aaa gga aat aac gac etc tta gtg ctt act cag ccg caa att 192 
Asp Leu LyB Gly Asn Asn Asp Leu Leu Val Leu Thr Gin Pro Gin He 
50 55 60 



att aaa gag att cac tec get tac ctt gaa gcg ggg gcg gat att ctt 
He Lys Glu He His Ser Ala Tyr Leu Glu Ala Gly Ala Asp He Leu 
65 70 75 



80 



tea gga cag aca acg gaa get ttc tac aac gec ttg cgt cat gta egg 
Ser Gly Gin Thr Thr Glu Ala Phe Tyr Asn Ala Leu Arg His Val Arg 
225 230 235 



240 



cac cca aat gec gga ctg ccc aat gcg ttt ggt gaa tac gat etc tct 
His Pro Asn Ala Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Ser 
275 280 285 



240 



gag acc aac ace ttt aac tea acc acg att gec atg gca gac tat gac 288 
Glu Thr Asn Thr Phe Asn Ser Thr Thr He Ala Met Ala Asp Tyr Asp 
85 90 95 

atg caa teg etc agt get gaa att aac ttt gec gcg get aag ctt gca 336 
Met Gin Ser Leu Ser Ala Glu He Asn Phe Ala Ala Ala Lys Leu Ala 
100 105 no 

cgt gaa gtc gcg gat gag tgg acg get aaa gat cca agt egg cca cgc 384 
Arg Glu Val Ala Asp Glu Trp Thr Ala Lys Asp Pro Ser Arg Pro Arg 
115 120 * 125 

tat gtg get ggt gtg ctt ggg cca acc aac cgt act tgc tct att teg 432 
Tyr Val Ala Gly Val Leu Gly Pro Thr Asn Arg Thr Cys Ser He Ser 
130 135 140 

cca gat gtg aac gat cca gga ttt cgt aac gtc act ttt gat ggg ctt 480 
Pro Asp Val Asn Asp Pro Gly Phe Arg Asn Val Thr Phe Asp Gly Leu 
150 155 160 

gtt gaa gec tat tec gaa teg acg cgc get ttg ate aaa ggt ggc age 528 
Val Glu Ala Tyr Ser Glu Ser Thr Arg Ala Leu He Lys Gly Gly Ser 
165 170 175 

gat ctg ate etc att gaa acc ate ttc gat aca ctt aac gee aaa gee 576 
Asp Leu He Leu lie Glu Thr lie Phe Asp Thr Leu Asn Ala Lys Ala 
180 185 190 

tgt gcg ttt gcg gtc gat age gta ttt gaa gag ctg ggc ate age tta 624 
Cys Ala Phe Ala Val Asp Ser Val Phe Glu Glu Leu Gly He Ser Leu 
195 200 205 

cct gtg atg att tec ggc acg att acc gat gec tct ggg cga act ctg 672 
Pro Val Met He Ser Gly Thr He Thr Asp Ala Ser Gly Arg Thr Leu 
210 215 220 



720 



ccg att teg ttt ggc ttg aac tgt gcg tta ggt cct gat gag ctg cgc 768 
Pro He Ser Phe Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg 
245 250 255 

cag tac gtg gaa gag ctt tea cgc att tea gaa tgc tat gtt tec gcg 816 
Gin Tyr Val Glu Glu Leu Ser Arg He Ser Glu Cys Tyr Val Ser Ala 
260 265 ' 270 



B64 
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gcc gag gaa atg gca gaa cat att gcg gaa tgg gca caa get ggc ttt 912 
Ala Glu Glu Met Ala Glu His He Ala Glu Trp Ala Gin Ala Gly Phe 
290 295 300 

ttg aat ttg gtc ggt ggt tgc tgt gga act aca cct gag cat ate gcc 960 
Leu Asn Leu Val Gly Gly Cys Cys Gly Thr Thr Pro Glu His He Ala 
305 310 315 320 

gcc att gcc aaa gcc gtc gag ggt gta aaa cca agg get ctg cca gat 1008 
Ala He Ala Lys Ala Val Glu Gly Val Lys Pro Arg Ala LeAi Pro Asp 
325 330 335 

ctg aaa gta gaa tgt cgt etc teg ggt tta gag ccg etc aat att ggt 1056 
Leu Lys Val Glu Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn He Gly 
340 345 350 

cct gaa ace ttg ttt gtt aac gtg ggc gaa cgt act aac gtc acc ggt 1104 
Pro Glu Thr Leu Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly 
355 360 365 

tct gcg cgt ttt aag cgt tta att aaa gaa gag caa tac gac gaa gcg 1152 
Ser Ala Arg Phe Lys Arg Leu He Lys Glu Glu Gin Tyr Asp Glu Ala 
370 375 380 

etc gat gtg gcg cgt gag caa gtc gaa aac ggc gcg cag ate att gat 1200 
Leu Asp Val Ala Arg Glu Gin Val Glu Asn Gly Ala Gin He He Asp 
385 390 395 400 

ate aac atg gat gaa ggc atg ttg gac gcc gag gcg tgt atg gtg cgc 1248 
He Asn Met Asp Glu Gly Met Leu Asp Ala Glu Ala Cys Met Val Arg 
405 410 415 

ttt ttg aat eta tgc gcc tct gaa cca gaa ata tec aaa gtt ccg gtg 1296 
Phe Leu Asn Leu Cys Ala Ser Glu Pro Glu He Ser Lys Val Pro Val 
420 425 430 

atg gtc gac tec tct aaa tgg gaa gtc att gaa gcg ggt ctg aaa tgc 1344 
Met Val Asp Ser Ser Lys Trp Glu Val lie Glu Ala Gly Leu Lys Cys 
435 440 445 

att cag ggt aaa ggc ate gtc aac tct ate tct eta aaa gaa ggg aaa 1392 
He Gin Gly Lys Gly lie Val Asn Ser lie Ser Leu Lys Glu Gly Lys 
450 455 460 

gag aag ttt att gcc caa gcc aaa ttg gtg cgc cgc tac ggt gcc gcg 1440 
Glu Lys Phe He Ala Gin Ala Lys Leu Val Arg Arg Tyr Gly Ala Ala 
465 470 475 480 

gtg att gtg atg gca ttt gac gaa gtg ggc caa gcc gat acc cgt gag 14 88 
Val He Val Met Ala Phe Asp Glu Val Gly Gin Ala Asp Thr Arg Glu 
485 490 495 

cgc aaa tta gag ate tgt cgt egg get tac cat att ttg gtc gat gag 1536 
Arg Lys Leu Glu lie Cys Arg Arg Ala Tyr His He Leu Val Asp Glu 
5 °0 505 510 

gtg ggc ttc cca ccg gaa gat att att ttt gac ccg aac ate ttt get 1584 
Val Gly Phe Pro Pro Glu Asp He lie Phe Asp Pro Asn lie Phe Ala 
515 520 525 

gtt gcg acc gga att gat gag cac aat aac tac gca ctg gat ttc att 1632 



I 
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Val Ala Thr Gly lie Asp Glu His Asn Asa Tyr Ala Leu Asp Phe lie 
530 535 540 

aat gca gtg gcg gac att aag cgt gag ctg ccg cat gcg atg att tct 1680 
Asn Ala Val Ala Asp lie Lys Arg Glu Leu Pro His Ala Net lie Ser 
545 550 555 560 

ggc ggt gtt tct aac gtt tec ttc tct ttc cgc ggc aac aac tat gtg 1728 
Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asn Tyr Val 
565 570 575 

cgt gaa gcg ate cat get gtt ttc ctt tat cac tgc ttc aaa cac ggc 1776 
Arg Glu Ala He His Ala Val Phe Leu Tyr His Cys phe Lys His Gly 
580 585 590 

atg gac atg ggg att gtc aac gca ggg cag ctt gaa ate tac gat aac 1824 
Met Asp Met Gly He Val Asn Ala Gly Gin Leu Glu He Tyr Asp Asn 
595 600 605 

gtt ccg ctg aaa ctg cgt gag gca gtg gaa gat gtg ate etc aat cga 1872 
Val Pro Leu Lys Leu Arg Glu Ala Val Glu Asp Val He Leu Asn Arg 
610 615 620 

cgt age gat ggc acg gaa aga ctg ctt gag ate gee gaa gcg tat cgc 1920 
Arg Ser Asp Gly Thr Glu Arg Leu Leu Glu He Ala Glu Ala Tyr Arg 
625 630 635 640 

gaa aac agt gtt ggt aaa gaa gag gat get tct gca tta gag tgg cgc 1968 
Glu Asn Ser Val Gly Lys Glu Glu Asp Ala Ser Ala Leu Glu Trp Arg 
645 650 655 

gca tgg cct gtg get aag cgc eta gag cac get tta gtc aaa ggc ate 2016 
Ala Trp Pro Val Ala Lys Arg Leu Glu His Ala Leu Val Lys Gly He 
660 665 670 

acc gaa ttt ate gtc caa gac act gaa gaa gca cgt cag caa gee agt 2064 
Thr Glu Phe He Val Gin Asp Thr Glu Glu Ala Arg Gin Gin Ala Ser 
675 680 685 

aaa cca ctg gaa gtg att gaa ggg ccg ctg atg gat ggt atg aac gtg 2112 
Lys Pro Leu Glu Val He Glu Gly Pro Leu Met Asp Gly Met Asn Val 
690 695 700 

gtc ggt gac ttg ttc ggg gaa ggg aaa atg ttc eta ccg caa gtc gta 2160 
Val Gly Asp Leu Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val 
705 710 715 720 

aaa tea gcg cgt gtc atg aaa caa gee gtt gcg tat ctt gag cct ttc 2208 
Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala Tyr Leu Glu Pro Phe 
725 730 * 735 

att aat gcg caa aaa agt ggt age act tea aat ggt aag att ttg ctg 2256 
He Asn Ala Gin Lys Ser Gly Ser Thr Ser ABn Gly Lys He Leu Leu 
740 745 750 

gcg acc gta aaa ggc gat gtg cat gac att ggt aag aac att gtt ggc 2304 
Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val Gly 
755 760 765 

gtc gtg ctg cag tgt aat aac ttc gag ate ate gat ctt ggt gtg atg 2352 
Val Val Leu Gin Cys Asn Asn Phe Glu He He Asp Leu Gly Val Met 
770 775 780 
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gtg cct tgc gag cag ate etc aaa gtc gca cgc gag caa aat gtc gat 2400 
Val Pro Cys Glu Gin He Leu Lys Val Ala Arg Glu Gin Asn Val Asp 
785 790 795 , 800 

ate ate ggt etc tct ggg ctt ate acg ccg tct ttg gat gag atg gta 2448 
He He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met Val 
805 810 815 

cac gtg gcg aaa gag atg gag cga caa. ggg ttt gaa ctg cca ctt ttg 2496 
His Val Ala Lys Glu Met Glu Arg Gin Gly Phe Glu Leu Pro Leu Leu 
820 825 830 

att ggt ggg gca aca acg tct aaa gcg cat act gcg gtg aag att gaa 2544 
He Gly Gly Ala Thr Thr Ser Lya Ala His Thr Ala Val Lys He Glu 
835 840 845 

cag aat tat cat gcg cct gta gtg tac gtg aat aac gcg teg cgc gcg 2592 
Gin Asn Tyr His Ala Pro Val Val Tyr Val Asn ABn Ala Ser Arg Ala 
850 855 860 

gta ggg gtg tgc aca tea tta ttg tct gat gaa cag cgc ccc gga ttt 2640 
Val Gly Val Cys Thr Ser Leu Leu Ser Asp Glu Gin Arg Pro Gly Phe 
865 870 875 880 

ate gaa cgt ttg gat etc gat tat gag cgc acg cgt gat cag cat get 2688 
He Glu Arg Leu Asp Leu Asp Tyr Glu Arg Thr Arg Asp Gin His Ala 
885 890 , 895 

cgt aaa acg ccc aaa teg cgc cca gtc acg tta gag cag gca cgt get 2736 
Arg Lys Thr Pro Lys Ser Arg Pro Val Thr Leu Glu Gin Ala Arg Ala 
900 905 910 

aat aaa gcg gcg ctg gat tgg gca aat tac acg ccg ccc get cct gcg 2784 
Asn Lys Ala Ala Leu Asp Trp Ala Asn Tyr Thr Pro Pro Ala Pro Ala 
915 920 925 

aaa ccg ggt gtg cat gtg ttt gaa aac att gcg tta gee aca eta cgt 2832 
Lys Pro Gly Val His Val Phe Glu Asn He Ala Leu Ala Thr Leu Arg 
930 935 940 

cct tat ate gat tgg acg cct ttt ttt atg act tgg teg ctt atg ggc 2880 
Pro Tyr He Asp Trp Thr Pro Phe Phe Met Thr Trp Ser Leu Met Gly 
945 950 955 960 

aaa tac cct gec att ttg gag cat gaa gag gtc ggt gaa gag gee aaa 2928 
Lys Tyr Pro Ala lie Leu Glu His Glu Glu Val Gly Glu Glu Ala Lys 
965 970 975 

cgt ctg ttt cat gat gee aat gee tta ctt gat aaa gta gag cga gaa 2976 
Arg Leu Phe His Asp Ala Asn Ala Leu Leu Asp Lys Val Glu Arg Glu 
980 985 * 990 

gga eta ctg aaa gee agt ggt atg tgt gca ctg ttt cca gca gec age 3024 
Gly Leu Leu Lys Ala Ser Gly Met Cys Ala Leu Phe Pro Ala Ala Ser 
995 1000 1005 

gtg ggc gat gac att gag gtg tac agt gat gaa teg cgt acg caa gtc 3072 
Val Gly Asp Asp He Glu Val Tyr Ser Asp Glu Ser Arg Thr Gin Val 
1010 ipi5 1020 

gcg cat gtg ctg tac aac ttg cgt cag cag act gag aaa ccg aaa ggg 3120 
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Ala His Val Leu Tyr Asn Leu Arg Gin Gin Thr Glu Lys Pro Lys Gly 
1025 1030 1035 1040 



gcc aac tac tgt ttg teg gac tat gtt get ccg aaa gag age ggt aaa 
Ala Asn Tyr Cys Leu Ser Asp Tyr Val Ala Pro Lys Glu Ser Gly Lys 
1045 1050 1055 



3168 



cgt gat tgg att ggc gcg ttt gca gta act ggt ggc att ggt gag cga 
Arg Asp Trp lie Gly Ala Phe Ala Val Thr Gly Gly He Gly Glu Arg 
1060 1065 1070 



3216 



gcc ttg gcc gat get tat aaa get cag ggt gat gat tac aat gcg ate 
Ala Leu Ala Asp Ala Tyr Lys Ala Gin Gly Asp Asp Tyr Asn Ala He 
1075 1080 1085 



3264 



atg ate caa gcg gta gcc gat cgt ttg gcg gaa gcc ttt gcg gaa tat 
Met He Gin Ala Val Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr 
1090 1095 1100 



3312 



ctg cat gaa aaa gtg cgt aaa gag att tgg ggt tat gcg age gat gaa 
Leu His Glu Lys Val Arg Lys Glu He Trp Gly Tyr Ala Ser Asp Glu 
1105 1110 1115 1120 



3360 



aat etc tec aat gat gac ctg ate cgt gag cgt tat cag ggc att cga 
Asn Leu Ser Asn Asp Asp Leu He Arg Glu Arg Tyr Gin Gly He Arg 
1125 1130 1135 



3408 



ccc gcg ccg ggg tat ccc gcg tgt cct gag cat acc gag aaa gcg act 
Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Glu Lys Ala Thr 
1140 1145 1150 



3456 



ttg tgg cag atg eta aat gtc gaa gag acc ata ggt atg tea ctg acc 
Leu Trp Gin Met Leu Asn Val Glu Glu Thr He Gly Met Ser Leu Thr 
1155 1160 1165 



3504 



aca age tat gcg atg tgg ccg ggc get teg gta tec ggt tgg tat ttc 
Thr Ser Tyr Ala Met Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe 
1170 1175 HBO 



3552 



teg cat ccc gat tct cgc tat ttt gcg gta gcg cag ate caa cca gat 
Ser His Pro Asp Ser Arg Tyr Phe Ala Val Ala Gin He Gin Pro Asp 
1185 1190 1195 1200 



3600 



caa ctg cac age tac get gag cgt aaa ggt tgg cgt ttg gaa gaa get 
Gin Leu His Ser Tyr Ala Glu Arg Lys Gly Trp Arg Leu Glu Glu Ala 
1205 1210 * 1215 



3648 



gaa aag tgg eta gcg cct aac ctt gat get taa 
Glu Lys Trp Leu Ala Pro Asn Leu Asp Ala 
1220 1225 



3681 



<210> 16 
<211> 1226 
<212> PRT 

<213> Vibrio cholerae 
<400> 16 

Val Gly Lys Glu Val Arg Gin Gin Leu Glu Gin Gin Leu Lys Gin Arg 
15 10 15 



He Leu Leu He Asp Gly Gly Met Gly Thr Met He Gin Ser Tyr Lys 



WO 03/087386 PC17EP03/04010 

66 

20 25 30 

Leu Gin Glu Glu Asp Tyr Arg Gly Ala Arg Phe Val Asp Trp His Cys 
35 40 45 

Asp Leu Lys Gly Asn Asn Asp Leu Leu Val Leu Thr Gin Pro Gin He 
50 55 60 

He Lys Glu He His Ser Ala Tyr Leu Glu Ala Gly Ala Asp He Leu 
65 7 ° 75 80 

Glu Thr Asn Thr Phe Asn Ser Thr Thr He Ala Met Ala Asp Tyr Asp 
85 90 95 

Met Gin Ser Leu Ser Ala Glu He Asn Phe Ala Ala Ala Lys Leu Ala 
100 105 no 

Arg Glu Val Ala Asp Glu Trp Thr Ala Lys Asp Pro Ser Arg Pro Arg 
115 120 125 

Tyr Val Ala Gly Val Leu Gly Pro Thr Asn Arg Thr Cys Ser He Ser 
130 135 140 

Pro Asp Val Asn Asp Pro Gly Phe Arg Asn Val Thr Phe Asp Gly Leu 
145 150 155 160 

Val Glu Ala Tyr Ser Glu Ser Thr Arg Ala Leu He LyB Gly fcly Ser 
165 170 175 

Asp Leu He Leu He Glu Thr He Phe Asp Thr Leu Asn Ala Lys Ala 
180 185 190 

Cys Ala Phe Ala Val Asp Ser Val Phe Glu Glu Leu Gly He Ser Leu 
195 200 205 

Pro Val Met He Ser Gly Thr He Thr Asp Ala Ser Gly Arg Thr Leu 
210 215 220 

Ser Gly Gin Thr Thr Glu Ala Phe Tyr Asn Ala Leu Arg His Val Arg 
225 230 235 240 

Pro He Ser Phe Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg 
245 250 255 

Gin Tyr Val Glu Glu Leu Ser Arg He Ser Glu Cys Tyr Val Ser Ala 
260 265 * 270 

His Pro Asn Ala Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Ser 
275 280 285 

Ala Glu Glu Met Ala Glu His He Ala Glu Trp Ala Gin Ala Gly Phe 
290 295 300 

Leu Asn Leu Val Gly Gly Cys Cys Gly Thr Thr Pro Glu HiB He Ala 
305 310 315 320 

Ala He Ala Lys Ala Val Glu Gly Val Lys Pro Arg Ala Leu Pro Asp 
325 330 335 

Leu Lys Val Glu Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn He Gly 
340 345 350 
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Pro Glu Thr Leu Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly 
355 360 365 

Ser Ala Arg Phe Lys Arg Leu He Lys Glu Glu Gin Tyr Asp Glu Ala 
370 375 380 

Leu Asp Val Ala Arg Glu Gin Val Glu Asn Gly Ala Gin He He Asp 
385 390 395 400 

He Asn Met Asp Glu Gly Met Leu Asp Ala Glu Ala Cys Met Val Arg 
405 410 * 415 

Phe Leu Asn Leu Cys Ala Ser Glu Pro Glu He Ser Lys Val Pro Val 
420 425 430 

Met Val Asp Ser Ser Lys Trp Glu Val He Glu Ala Gly Leu Lys Cys 
435 440 445 

He Gin Gly Lys Gly He Val Asn Ser He Ser Leu Lys Glu Gly Lys 
450 455 460 

Glu Lys Phe He Ala Gin Ala Lys Leu Val Arg Arg Tyr Gly Ala Ala 
465 470 475 480 

Val He Val Met Ala Phe Asp Glu Val Gly Gin Ala Asp Thr Arg Glu 
485 490 495 

Arg Lys Leu Glu He Cys Arg Arg Ala Tyr His He Leu Val Asp Glu 
500 505 510 

val Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Asn He Phe Ala 
515 520 525 

Val Ala Thr Gly He Asp Glu His Asn Asn Tyr Ala Leu Asp Phe He 
530 535 540 

Asn Ala Val Ala Asp He Lys Arg Glu Leu Pro His Ala Met He Ser 
545 550 555 560 

Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asn Tyr Val 
565 570 575 

Arg Glu Ala He His Ala Val Phe Leu Tyr His Cys Phe Lys His Gly 
580 585 590 

Met Asp Met Gly He Val Asn Ala Gly Gin Leu Glu He Tyr Asp Asn 
595 600 605 

Val Pro Leu Lys Leu Arg Glu Ala Val Glu Asp Val He Leu Asn Arg 
610 615 620 

Arg Ser Asp Gly Thr Glu Arg Leu Leu Glu He Ala Glu Ala Tyr Arg 
625 630 635 640 

Glu Asn Ser Val Gly Lys Glu Glu Asp Ala Ser Ala Leu Glu Trp Arg 
645 650 655 

Ala Trp Pro Val Ala Lys Arg Leu Glu His Ala Leu Val Lys Gly He 
660 665 670 

Thr Glu Phe He Val Gin Asp Thr Glu Glu Ala Arg Gin Gin Ala Ser 
675 680 685 
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Lya Pro Leu Glu Val lie Glu Gly Pro Leu Met Aep Gly Met Asn Val 
690 695 700 

Val Gly Asp Leu Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val 
705 710 715 720 

Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala Tyr Leu Glu Pro Phe 
725 730 735 

He Asn Ala Gin Lys Ser Gly Ser Thr Ser Asn Gly Lys He Leu Leu 
740 745 750 

Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val Gly 
755 760 765 

Val Val Leu Gin Cys Asn Asn Phe Glu He lie Asp Leu Gly Val Met 
770 775 780 

Val Pro Cys Glu Gin He Leu Lys Val Ala Arg Glu Gin Asn Val Asp 
785 790 795 800 

He He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met Val 
805 810 815 

His Val Ala Lys Glu Met Glu Arg Gin Gly Phe Glu Leu Pro Leu Leu 
820 825 830 

He Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys lie Glu 
835 840 845 

Gin Asn Tyr HiB Ala Pro Val Val Tyr Val Asn Asn Ala Ser Arg Ala 
850 855 860 

Val Gly Val Cys Thr Ser Leu Leu Ser Asp Glu Gin Arg Pro Gly Phe 
865 870 875 880 

He Glu Arg Leu Asp Leu Asp Tyr Glu Arg Thr Arg Asp Gin His Ala 
885 890 895 

Arg Lys Thr Pro Lys Ser Arg Pro Val Thr Leu Glu Gin Ala Arg Ala 
900 905 910 

Asn Lys Ala Ala Leu Asp Trp Ala Asn Tyr Thr Pro Pro Ala Pro Ala 
915 920 925 

Lys Pro Gly Val His Val Phe Glu Asn He Ala Leu Ala Thr Leu Arg 
930 935 940 

Pro Tyr He Asp Trp Thr Pro Phe Phe Met Thr Trp Ser Leu Met Gly 
545 950 955 960 

Lys Tyr Pro Ala lie Leu Glu His Glu Glu Val Gly Glu Glu Ala Lys 
965 970 975 

Arg Leu Phe His Asp Ala Asn Ala Leu Leu Asp Lys Val Glu Arg Glu 
980 985 990 

Gly Leu Leu Lys Ala Ser Gly Met Cys Ala Leu Phe Pro Ala Ala Ser 
995 1000 1005 

Val Gly Asp Asp He Glu Val Tyr Ser Asp Glu Ser Arg Thr Gin Val 
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1010 1015 1020 

Ala His Val Leu Tyr Asn Leu Arg Gin Gin Thr Glu Lys Pro Lys Gly 
1025 1030 1035 1040 

Ala Asn Tyr Cys Leu Ser Asp Tyr Val Ala Pro Lys Glu Ser Gly Lys 
1045 1050 1055 

Arg Asp Trp lie Gly Ala Phe Ala Val Thr Gly Gly lie Gly Glu Arg 
1060 1065 * 1070 

Ala Leu Ala Asp Ala Tyr Lys Ala Gin Gly Asp Asp Tyr Asn Ala He 
1075 1080 1085 

Met He Gin Ala Val Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr 
1090 1095 1100 

Leu His Glu Lys Val Arg Lys Glu He Trp Gly Tyr Ala Ser Asp Glu 
H05 1110 ins 1120 

Asn Leu Ser Asn Asp Asp Leu He Arg Glu Arg Tyr Gin Gly He Arg 
1125 H30 1135 

Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Glu Lys Ala Thr 
1140 1145 H50 

Leu Trp Gin Met Leu Asn Val Glu Glu Thr He Gly Met Ser Leu Thr 
1155 1160 1165 

Thr Ser Tyr Ala Met Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe 
1170 1175 1180 

Ser His Pro Asp Ser Arg Tyr Phe Ala Val Ala Gin He Gin Pro Asp 
H85 1190 H95 1200 

Gin Leu His Ser Tyr Ala Glu Arg Lys Gly Trp Arg Leu Glu Glu Ala 
1205 1210 "* 1215 

Glu Lys Trp Leu Ala Pro Asn Leu Asp Ala 
1220 1225 



<210> 17 
<211> 3822 
<212> DNA 

<213> Sinorhizobium meliloti 

<220> 
<221> CDS 
<222> (1) . . (3819) 
<223> RSM07338 

<400> 17 

gtg agt aaa teg ata att ctt tgt cgt ttt cag aac ggg aga tct ccc 48 

Val Ser Lys Ser He He Leu Cys Arg Phe Gin Asn Gly Arg Ser Pro 
15 10 15 

atg tec gee gec gac gec etc ttt gga aac gtc teg ccc aag ccg gat 96 
Met Ser Ala Ala Asp Ala Leu Phe Gly Asn Val Ser Pro Lys Pro Asp 
20 25 30 



ggt teg gaa gtc ttt egg cag etc gec cag gcg gcg get gaa cgc ate 144 
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Gly Ser Glu Val Phe Arg Gin Leu Ala Gin Ala Ala Ala Glu Arg lie 
35 40 45 

etc ate atg gat ggc gee atg gga acg gag ate cag cag etc ggt ttc 192 
Leu lie Met Asp Gly Ala Met Gly Thr Glu lie Gin Gin Leu Gly Phe 
50 55 60 

gtg gag gat cac ttc cgc ggc gag cgc ttc ggt ggc tgc gee tgc cat 240 
Val Glu Asp His Phe Arg Gly Glu Arg Phe Gly Gly Cys Ala Cys His 
65 70 75 80 

cag cag ggc aac aac gac etc ctg acg etc act cag ccg aag gcg ate 288 
Gin Gin Gly Asn Asn Asp Leu Leu Thr Leu Thr Gin Pro Lys Ala lie 
85 90 95 

gag gat att cat tac cac tac gee ate gee ggc gec gat ate etc gaa 336 
Glu Asp He His Tyr His Tyr Ala He Ala Gly Ala Asp He Leu Glu 
100 105 no 

ace aac ace ttc tec teg acg egg ate gee cag gee gat tac ggc atg 384 
Thr Asn Thr Phe Ser Ser Thr Arg He Ala Gin Ala Asp Tyr Gly Met 
115 120 125 

gag gac atg gtc tac gat etc aat cgc gac ggc gcg egg ctg gcg egg 432 
Glu Asp Met Val Tyr Asp Leu Asn Arg Asp Gly Ala Arg Leu Ala Arg 
130 135 140 

cga gee gcg aag egg gee gag gcg gag gat ggc egg egg cgc ttc gtg 480 
Arg Ala Ala Lys Arg Ala Glu Ala Glu Asp Gly Arg Arg Arg Phe Val 
145 150 155 160 

gca ggc gcg etc ggc ccc acc aac cgc ace get teg att teg ccg gac 528 
Ala Gly Ala Leu Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro Asp 
165 170 175 

gtc aac aac ccc ggc tat cga gee gtc age ttc gac gat ctg agg etc 576 
Val Asn Asn Pro Gly Tyr Arg Ala Val Ser Phe Abp Asp Leu Arg Leu 
180 IBS 190 

gee tat gee gag cag gtg egg ggc etc ate gac ggc ggt gee gac ate 624 
Ala Tyr Ala Glu Gin Val Arg Gly Leu He Asp Gly Gly Ala Asp He 
195 200 205 

ate ctg ate gag acg ate ttc gac acg ctg aat gee aag gcg gcg ate 672 
He Leu He Glu Thr He Phe Asp Thr Leu Asn Ala Lys Ala Ala He 
210 215 220 

ttc gcg acg cag gaa gtc ttt gee gaa aag ggc gtc cgc ctt ccg gtg 720 
Phe Ala Thr Gin Glu Val Phe Ala Glu Lys Gly Val Arg Leu Pro Val 
225 230 235 ~ 240 

atg ate tec gga acg ate acc gat etc tec ggc cgt acc etc tec ggc 768 
Met He Ser Gly Thr He Thr Asp Leu Ser Gly Arg Thr Leu Ser Gly 
245 250 255 

cag acg cct acg gee ttc tgg tat teg gtg cgc cat gcg gat ccg ttt 816 
Gin Thr Pro Thr Ala Phe Trp Tyr Ser Val Arg His Ala Asp Pro Phe 
260 26S 270 

acg ate ggg etc aac tgc gcg etc ggc gca aat gcg atg cgc gee cat 864 
Thr He Gly Leu Asn Cys Ala Leu Gly Ala Asn Ala Met Arg Ala His 
275 280 285 
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ata gac gag ctt teg gcg gtc gec gac acg etc gtc tgc gee tat ccg 912 
lie Asp Glu Leu Ser Ala Val Ala Asp Thr Leu Val Cys Ala Tyr Pro 
290 295 300 

aat gec ggc ctg ccg aac gag ttc ggc cgc tat gac gaa age ccc gag 960 
Asn Ala Gly Leu Pro Asn Glu Phe Gly Arg Tyr Asp Glu Ser Pro Glu 
305 310 315 320 

cag atg gcg gcg cag gtc gag ggc ttc gec egg gac ggt etc gtc aac 1008 
Gin Met Ala Ala Gin Val Glu Gly Phe Ala Arg Asp Gly Leu Val Asn 
325 330 335 

ate gtc ggc ggc tgc tgc ggt tec acg ccg gec cat ate cgc gee att 1056 
He Val Gly Gly Cys Cys Gly Ser Thr Pro Ala His He Arg Ala He 
340 345 350 

gee gaa gcg gtt gec aaa tat ccg ccg cgc egg gtg ccc gag ate gat 1104 
Ala Glu Ala Val Ala Lys Tyr Pro Pro Arg Arg Val Pro Glu He Asp 
355 360 365 

cgc cgc atg egg ctt tec ggc etc gaa ccc ttc acg ctt ace gac gag 1152 
Arg Arg Met Arg Leu Ser Gly Leu Glu Pro Phe Thr Leu Thr Asp Glu 
370 375 380 

att ccc ttc gtc aac gtc ggc gaa cgc acc aac gtc ace ggc teg gcg 1200 
He Pro Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala 
385 390 395 400 

aag ttc cgc aag ctg ate acc gee ggg gac tac gee gec gca etc gat 1248 
Lys Phe Arg Lys Leu He Thr Ala Gly Asp Tyr Ala Ala Ala Leu Asp 
405 410 415 

gtg gcg cgt gat cag gtg gcg aat ggc gec cag ate ate gac gtc aac 1296 
Val Ala Arg Asp Gin Val Ala Asn Gly Ala Gin He He Asp Val Asn 
420 425 430 

atg gac gaa ggc ctg ate gat teg aag cag gtg atg gtc gag ttc ctg 1344 
Met Asp Glu Gly Leu He Asp Ser Lys Gin Val Met Val Glu Phe Leu 
435 440 445 

aac etc gtc gec tec gag ccg gat ate gec cgt gta ccg gtg atg ate 1392 
Asn Leu Val Ala Ser Glu Pro Asp He Ala Arg Val Pro Val Met He 
450 455 460 

gat teg teg aaa tgg gag gtg ate gaa gec ggg etc aaa tgc gtc cag 1440 
Asp Ser Ser Lys Trp Glu Val He Glu Ala Gly Leu Lys Cys Val Gin 
465 470 475 " 480 

ggc aag gcg ctg gtg aac tec ate teg etc aag gaa ggc gag gcg get 1488 
Gly Lys Ala Leu Val Asn Ser He Ser Leu Lys Glu Gly Glu Ala Ala 
485 490 ' 495 

ttc ctg cac cat gcg cgc etc gtg cgc gec tat ggc gee gcg gtc gtg 1536 
Phe Leu His His Ala Arg Leu Val Arg Ala Tyr Gly Ala Ala Val Val 
500 505 * 510 

gtg atg gcg ttc gac gag aag ggc cag gec gac acg aaa acc cgc aag 1584 
Val Met Ala Phe Asp Glu Lys Gly Gin Ala Asp Thr Lys Thr Arg Lys 
515 520 525 

gtg gaa ate tgc egg egg gec tat egg ctg ctg acg gaa gag gtt ggc 1632 
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Val Glu lie Cys Arg Arg Ala Tyr Arg Leu Leu Thr Glu Glu Val Gly 
530 535 540 

ttc ccc ccg gag gac ate ate ttc gac ccg aat ate ttc gcg gtc gcg 1680 
Phe Pro Pro Glu Asp He He Phe Asp Pro Asn He Phe Ala Val Ala 
545 550 555 560 

ace ggc ate gag gag cac aac aat tac ggc gtc gac ttc ate gag gcg 1728 
Thr Gly He Glu Glu His Asn Asn Tyr Gly Val Asp Phe He Glu Ala 
565 570 575 

acg cac gag ate ate gcg gca ctg ccg cat gtc cac gtc tec ggc ggc 1776 
Thr His Glu He He Ala Ala Leu Pro His Val His Val Ser Gly Gly 
580 585 590 

gtg teg aac etc tec ttt tec ttc cgc ggc aac gag ccg gtg cgc gag 1824 
Val Ser Asn Leu Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg Glu 
595 600 " 605 

gcg atg cac gee ate ttc ctt tat cac gcg ate cag gee ggc atg gac 1872 
Ala Met His Ala He Phe Leu Tyr His Ala He Gin Ala Gly Met Asp 
610 615 620 

atg ggc ate gtc aat gec gga cag etc gee gtc tat gat gcg ate gac 1920 
Met Gly He Val Asn Ala Gly Gin Leu Ala Val Tyr Asp Ala He Asp 
625 630 635 640 

ccg gaa ctg cgc gaa ace tgc gag gac gtg gtg etc aac cgc egg gee 1968 
Pro Glu Leu Arg Glu Thr Cys Glu Asp Val Val Leu Asn Arg Arg Ala 
645 650 655 

gat teg ace gag cgc etc ctg gag ate gee gag cgc tat cgc ggg aag 2016 
Asp Ser Thr Glu Arg Leu Leu Glu He Ala Glu Arg Tyr Arg Gly Lys 
660 665 670 

99C ggg age cag ggc aag gag aag gac ctt gee tgg cgc gaa tgg ccg 2064 
Gly Gly Ser Gin Gly Lys Glu Lys Asp Leu Ala Trp Arg Glu Trp Pro 
675 680 685 

gtg gag aag egg etc gaa cac gcg etc gtc aat gga att ace gaa ttt 2112 
Val Glu Lys Arg Leu Glu His Ala Leu Val Asn Gly He Thr Glu Phe 
690 695 700 

ate gaa gee gat acg gaa gag gee egg ctt gee gee gag egg ccg ctg 2160 
He Glu Ala Asp Thr Glu Glu Ala Arg Leu Ala Ala Glu Arg Pro Leu 
705 710 715 720 

cat gtc ate gaa ggc ccg ctg atg gec ggg atg aac gtc gtg ggc gat 2208 
His Val He Glu Gly Pro Leu Met Ala Gly Met Asn Val Val Gly Asp 
725 730 735 

etc ttc ggt tec ggc aag atg ttc ctg ccg cag gtg gtc aag tec gee 2256 
Leu Phe Gly Ser Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala 
740 745 750 

egg gtg atg aag cag gee gtt gcg gtg ctg etc ccc cat atg gag gag 2304 
Arg Val Met Lys Gin Ala Val Ala Val Leu Leu Pro His Met Glu Glu 
755 760 765 

gag aag cgc gee aat ggc ggc ggc gag gcg cgc gag agt gee ggc aag 2352 
Glu Lys Arg Ala Asn Gly Gly Gly Glu Ala Arg Glu Ser Ala Gly Lys 
770 775 780 
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ate ctg atg gcg acc gtc aag ggc gac gtg cac gac ate ggc aag aac 2400 
lie Leu Met Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn 
785 790 795 t 800 

ate gtc ggc gtc gtg etc gee tgc aac aat tac gag ate ate gac etc 2448 
He Val Gly Val Val Leu Ala Cya Asn Asn Tyr Glu He He Asp Leu 
805 810 815 

ggc gtc atg gtg ccc teg get aag ate etc gaa gtg gcg cgc gaa cag 2496 
Gly Val Met Val Pro Ser Ala Lys He Leu Glu Val Ala Af£ Glu Gin 
820 825 B30 

aag gtc gac ate gtc ggt ctt tec ggc etc ate acg ccg teg ctg gac 2544 
Lys Val Asp He Val Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp 
835 840 845 

gag atg gcg cat gtc get tec gag etc gaa egg gag ggc ttc gat gtc 2592 
Glu Met Ala His Val Ala Ser Glu Leu Glu Arg Glu Gly Phe Asp Val 
850 855 860 



ccg ctg ctg ate ggc ggg gcg acg acc age cgc gtg cac acg gee gtg 
Pro Leu Leu He Gly Gly Ala Thr Thr Ser Arg Val His Thr Ala Val 
865 870 875 



880 



aag ate aat ccg cgt tac age etc ggc cag acg gtc tat gtc acc gac 
Lys He Asn Pro Arg Tyr Ser Leu Gly Gin Thr Val Tyr Val Thr Asp 
885 890 895 



2640 



2688 



gec age cgc gcg gtc ggc gtc gta teg age ctg etc teg ccg gaa gtc 
Ala Ser Arg Ala Val Gly Val Val Ser Ser Leu Leu Ser Pro Glu Val 
500 905 9io 



2736 



cgc gac tec tac aag aaa acg gtc cgc gcg gag tat ctg aag gtt gee 2784 
Arg Asp Ser Tyr Lys Lys Thr Val Arg Ala Glu Tyr Leu Lys Val Ala 
915 920 925 

gac gca cat gee cgc aac gaa gee gag aag cgc cgt ctg ccg ctt tec 2832 
Asp Ala His Ala Arg Asn Glu Ala Glu Lys Arg Arg Leu Pro Leu Ser 
930 935 940 



cag gcg egg gcg aat gee ttt egg ata gat tgg gac gee cac cag ccg 
Gin Ala Arg Ala Asn Ala Phe Arg He Asp Trp Asp Ala His Gin Pro 
945 950 955 960 



2880 



aag gtt ccg tec ttc etc ggc acg cgt gtt ttc gag gga tgg gac etc 
Lys Val Pro Ser Phe Leu Gly Thr Arg Val Phe Glu Gly Trp Asp Leu 
965 970 975 

gec gaa etc gee cgc tat ate gac tgg acg ccg ttc ttc cag acc tgg 
Ala Glu Leu Ala Arg Tyr He Asp Trp Thr Pro Phe Phe Gin Thr Trp 
980 985 990 

gag ctg aag ggg gta ttc ccg aaa ate etc gat gac gaa cgc cag ggg 
Glu Leu Lys Gly Val Phe Pro Lys He Leu Asp Asp Glu Arg Gin Gly 
995 1000 1005 

get gec get cgc cag etc ttc gag gat gcg cag gcg atg gtc gaa aag 
Ala Ala Ala Arg Gin Leu Phe Glu Asp Ala Gin Ala Met Val Glu Lys 
1010 1015 1020 

ate gtg gee gag gca tgg ttc gec ccg aag gee gtg ate ggc ttc tgg 



2928 



2976 



3024 



3072 



3120 



WO 03/087386 



74 



PCT/EP03/04010 



lie Val Ala Glu Ala Trp Phe Ala Pro Lys Ala Val He Gly Phe Trp 
1025 1030 1035 1040 



ccg gcc gcc age atg ggc gac gac gtc cgc ctg ttt gec gac gag gtg 
Pro Ala Ala Ser Met Gly Asp Asp Val Arg Leu Phe Ala Asp Glu Val 
1045 1050 1055 



3168 



cgc gaa gcc gag ctt gcc acc ttc ttc acg etc cgc cag cag atg gtg 
Arg Glu Ala Glu Leu Ala Thr Phe Phe Thr Leu Arg Gin Gin Met Val 
1060 1065 1070 



3216 



aag cgc gac ggc egg ccg aac gtc gcc ctt gcc gac ttc gtc gcc ccg 
Lys Arg Asp Gly Arg Pro Asn Val Ala Leu Ala Asp Phe Val Ala Pro 
1075 1080 1085 



3264 



gcg gcg age ggc aag egg gac tat gtc ggc ggt ttc gtg gtg acg gcc 
Ala Ala Ser Gly Lys Arg Asp Tyr Val Gly Gly Phe Val Val Thr Ala 
1090 1095 1100 



3312 



ggc ate gag gaa gtg gcg ate gcc gaa cgc ttc gaa egg gcg aac gac 
Gly He Glu Glu Val Ala He Ala Glu Arg Phe Glu Arg Ala Asn Asp 
H05 1110 1115 1120 



3360 



gat tat tec teg ate atg gtc aag gcg ctt gcg gac cgc ttc gca gag 
Asp Tyr Ser Ser He Met Val Lys Ala Leu Ala Asp Arg Phe Ala Glu 
1125 1130 1135 



3408 



gcc ttt gcc gag cgc atg cat gaa tat gtc cgc aag gag etc tgg ggc 
Ala Phe Ala Glu Arg Met His Glu Tyr Val Arg Lys Glu Leu Trp Gly 
1140 1145 1150 



3456 



tat get ccg gac gaa gcc ttc acg ccg cag gaa ttg ate gcc gag ccc 
Tyr Ala Pro Asp Glu Ala Phe Thr Pro Gin Glu Leu He Ala Glu Pro 
1155 1160 1165 



3504 



tat gcc ggc ate cgc cct gcg ccc ggc tac ccg gcg cag ccc gac cac 
Tyr Ala Gly He Arg Pro Ala Pro Gly Tyr Pro Ala Gin Pro Asp His 
1170 1175 1180 



3552 



acg gaa aag gag acg ctt ttc egg etc ctg gat gcg gaa gcc get ate 3600 
Thr Glu Lys Glu Thr Leu Phe Arg Leu Leu Asp Ala Glu Ala Ala He 
1185 1190 1195 1200 



ggc gtc egg etc acc gag age tat gcg atg tgg ccg ggc tct teg gta 
Gly Val Arg Leu Thr Glu Ser Tyr Ala Met Trp Pro Gly Ser Ser Val 
1205 1210 1215 



3648 



teg ggc etc tat gtc ggc cac ccc gat tec tat tac ttc ggc gtc gca 
Ser Gly Leu Tyr Val Gly His Pro Asp Ser Tyr Tyr Phe Gly Val Ala 
1220 1225 1230 



3696 



aag ate gag cgc gat cag gtg gag gac tat gcc gat cgc aag cgc atg 
Lys He Glu Arg Asp Gin Val Glu Asp Tyr Ala Asp Arg Lys Arg Met 
1235 1240 1245 



3744 



age gtc cgc gag gtc gag cgc tgg ctt teg ccg ate etc aat tac gtg 
Ser Val Arg Glu Val Glu Arg Trp Leu Ser Pro He Leu Asn Tyr Val 
1250 1255 1260 



3792 



ccg atg ccg gag acg gaa gcg gcg gag tag 
Pro Met Pro Glu Thr Glu Ala Ala Glu 
1265 1270 



3822 
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<210> 18 
<211> 1273 
<212> PRT 

<213> Sinorhizobium meliloti 
<400> 18 

Val Ser Lye Ser He He Leu Cys Arg Phe Gin Asn Gly Arg Ser Pro 
1 5 10 ( is 

Met Ser Ala Ala Asp Ala Leu Phe Gly Asn Val Ser Pro Lys Pro Asp 
20 25 30 

Gly Ser Glu Val Phe Arg Gin Leu Ala Gin Ala Ala Ala Glu Arg He 
35 40 45 

Leu lie Met Asp Gly Ala Met Gly Thr Glu He Gin Gin Leu Gly Phe 
50 55 60 

Val Glu Asp His Phe Arg Gly Glu Arg Phe Gly Gly Cys Ala Cys His 
65 70 7 5 BO 

Gin Gin Gly Asn Asn Asp Leu Leu Thr Leu Thr Gin Pro Lys Ala He 
85 90 95 

Glu Asp He His Tyr His Tyr Ala He Ala Gly Ala Asp He Leu Glu 
100 105 no 

Thr Asn Thr Phe Ser Ser Thr Arg He Ala Gin Ala Asp Tyr Gly Met 
115 120 125 

Glu Asp Met Val Tyr Asp Leu Asn Arg Asp Gly Ala Arg Leu Ala Arg 
130 135 140 

Arg Ala Ala Lys Arg Ala Glu Ala Glu Asp Gly Arg Arg Arg Phe Val 
i45 ISO 155 leo 

Ala Gly Ala Leu Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro Asp 
165 170 175 

Val Asn Asn Pro Gly Tyr Arg Ala Val Ser Phe Asp Asp Leu Arg Leu 
180 185 190 

Ala Tyr Ala Glu Gin Val Arg Gly Leu He Asp Gly Gly Ala Asp He 
135 200 205 

He Leu He Glu Thr He Phe Asp Thr Leu Asn Ala Lys Ala Ala He 
210 215 220 

Phe Ala Thr Gin Glu Val Phe Ala Glu Lys Gly Val Arg Leu Pro Val 
225 230 235 " 240 

Met He Ser Gly Thr He Thr Asp Leu Ser Gly Arg Thr Leu Ser Gly 
245 250 ~ 255 

Gin Thr Pro Thr Ala Phe Trp Tyr Ser Val Arg His Ala Asp Pro Phe 
260 265 270 

Thr He Gly Leu Asn Cys Ala Leu Gly Ala Asn Ala Met Arg Ala His 
275 280 285 
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lie Asp Glu Leu Ser Ala Val Ala Asp Thr Leu Val Cys Ala Tyr Pro 
290 295 300 

Asn Ala Gly Leu Pro Asn Glu Phe Gly Arg Tyr Asp Glu Ser Pro Glu 
305 310 315 * 320 

Gin Met Ala Ala Gin Val Glu Gly Phe Ala Arg Asp Gly Leu Val Asn 
325 330 * 335 

He Val Gly Gly Cys Cys Gly Ser Thr Pro Ala His He Arg Ala He 
340 345 350 

Ala Glu Ala Val Ala Lys Tyr Pro Pro Arg Arg Val Pro Glu lie Asp 
355 360 365 

Arg Arg Met Arg Leu Ser Gly Leu Glu Pro Phe Thr Leu Thr Asp Glu 
370 375 380 

He Pro Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala 
385 390 395 * 400 

Lys Phe Arg Lys Leu He Thr Ala Gly Asp Tyr Ala Ala Ala Leu Asp 
405 410 415 

Val Ala Arg Asp Gin Val Ala Asn Gly Ala Gin He He Asp Val Asn 
420 425 430 

Met Asp Glu Gly Leu He Asp Ser Lys Gin Val Met Val Glu Phe Leu 
435 440 445 

Asn Leu Val Ala Ser Glu Pro Asp He Ala Arg Val Pro Val Met He 
450 455 460 

Asp Ser Ser Lys Trp Glu Val He Glu Ala Gly Leu Lys Cys Val Gin 
465 470 475 * 480 

Gly Lys Ala Leu Val Asn Ser He Ser Leu Lys Glu Gly Glu Ala Ala 
485 490 495 

Phe Leu His His Ala Arg Leu Val Arg Ala Tyr Gly Ala Ala Val Val 
500 505 *" 510 

Val Met Ala Phe Asp Glu Lys Gly Gin Ala Asp Thr Lys Thr Arg Lys 
515 520 525 

Val Glu He Cys Arg Arg Ala Tyr Arg Leu Leu Thr Glu Glu Val Gly 
530 535 540 

Phe Pro Pro Glu Asp He He Phe Asp Pro Asn He Phe Ala Val Ala 
545 550 555 560 

Thr Gly He Glu Glu His Asn Asn Tyr Gly Val Asp Phe He Glu Ala 
565 570 575 

Thr His Glu He He Ala Ala Leu Pro HiB Val His Val Ser Gly Gly 
580 585 590 

Val Ser Asn Leu Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg Glu 
595 600 605 

Ala Met His Ala He Phe Leu Tyr His Ala He Gin Ala Gly Met Asp 
610 615 620 
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Met Gly He Val Asn Ala Gly Gin Leu Ala Val Tyr Asp Ala lie Asp 
625 «0 635 640 

Pro Glu Leu Arg Glu Thr Cye Glu Asp Val Val Leu Asn Arg Arg Ala 
645 650 555 

Asp Ser Thr Glu Arg Leu Leu Glu He Ala Glu Arg Tyr Arg Gly Lys 
660 665 670 

Gly Gly Ser Gin Gly Lys Glu Lys Asp Leu Ala Trp Arg Glu Trp Pro 
675 680 685 

Val Glu Lys Arg Leu Glu His Ala Leu Val Asn Gly He Thr Glu Phe 
690 695 700 

lie Glu Ala Asp Thr Glu Glu Ala Arg Leu Ala Ala Glu Arg Pro Leu 
705 710 715 720 

His Val He Glu Gly Pro Leu Met Ala Gly Met Asn Val Val Gly Asp 
725 730 735 

Leu Phe Gly Ser Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala 
740 745 750 

Arg Val Met Lys Gin Ala Val Ala Val Leu Leu Pro His Met Glu Glu 
7S5 760 765 

Glu Lys Arg Ala Asn Gly Gly Gly Glu Ala Arg Glu Ser Ala Gly Lys 
770 775 780 

He Leu Met Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn 
785 790 795 800 

lie Val Gly Val Val Leu Ala Cys Asn Asn Tyr Glu He He Asp Leu 
805 810 815 

Gly Val Met Val Pro Ser Ala Lys lie Leu Glu Val Ala Arg Glu Gin 
820 825 830 

Lys Val Asp He Val Gly Leu Ser Gly Leu lie Thr Pro Ser Leu Asp 
835 840 845 

Glu Met Ala His Val Ala Ser Glu Leu Glu Arg Glu Gly Phe Asp Val 
850 855 860 

Pro Leu Leu lie Gly Gly Ala Thr Thr Ser Arg Val His Thr Ala Val 
865 8 70 875 880 

Lys He Asn Pro Arg Tyr Ser Leu Gly Gin Thr Val Tyr Val Thr Asp 
885 890 895 

Ala Ser Arg Ala Val Gly Val Val Ser Ser Leu Leu Ser Pro Glu Val 
900 905 910 

Arg Asp Ser Tyr Lys Lys Thr Val Arg Ala Glu Tyr Leu Lys Val Ala 
915 920 925 

Asp Ala His Ala Arg Asn Glu Ala Glu Lys Arg Arg Leu Pro Leu Ser 
930 935 940 

Gin Ala Arg Ala Asn Ala Phe Arg He Asp Trp Asp Ala His Gin Pro 
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945 950 955 960 

Lys Val Pro Ser Phe Leu Gly Thr Arg Val Phe Glu Gly Trp Asp Leu 
965 970 975 

Ala Glu Leu Ala Arg Tyr lie Asp Trp Thr Pro Phe Phe Gin Thr Trp 
980 985 990 

Glu Leu Lys Gly Val Phe Pro Lys He Leu Asp Asp Glu Arg Gin Gly 
995 ' 1000 1005 

Ala Ala Ala Arg Gin Leu Phe Glu Asp Ala Gin Ala Met Val Glu Lys 
1010 1015 1020 

He Val Ala Glu Ala Trp Phe Ala Pro Lys Ala Val He Gly Phe Trp 
102 5 1030 1035 1040 

Pro Ala Ala Ser Met Gly Asp Asp Val Arg Leu Phe Ala Asp Glu Val 
1045 1050 1055 

Arg Glu Ala Glu Leu Ala Thr Phe Phe Thr Leu Arg Gin Gin Met Val 
1060 1065 ** 1070 

Lye Arg Asp Gly Arg Pro Asn Val Ala Leu Ala Asp Phe Val Ala Pro 
1075 1080 1085 

Ala Ala Ser Gly Lys Arg Asp Tyr Val Gly Gly Phe Val Val Thr Ala 
1090 1095 iioo 

Gly He Glu Glu Val Ala He Ala Glu Arg Phe Glu Arg Ala Asn Asp 
1105 HIO 1115 ~ 1120 

Asp Tyr Ser Ser He Met Val Lys Ala Leu Ala Asp Arg Phe Ala Glu 
1125 H30 H35 

Ala Phe Ala Glu Arg Met His Glu Tyr Val Arg Lys Glu Leu Trp Gly 
H40 H45 H50 

Tyr Ala Pro Asp Glu Ala Phe Thr Pro Gin Glu Leu He Ala Glu Pro 
1155 H60 lies 

Tyr Ala Gly He Arg Pro Ala Pro Gly Tyr Pro Ala Gin Pro Asp His 
1170 1175 iibo 

Thr Glu Lys Glu Thr Leu Phe Arg Leu Leu Asp Ala Glu Ala Ala He 
1185 H90 U95 1200 

Gly Val Arg Leu Thr Glu Ser Tyr Ala Met Trp Pro Gly Ser Ser Val 
1205 1210 * 1215 

Ser Gly Leu Tyr Val Gly His Pro Asp Ser Tyr Tyr Phe Gly Val Ala 
1220 1225 1230 

Lys He Glu Arg Asp Gin Val Glu Asp Tyr Ala Asp Arg Lys Arg Met 
1235 1240 1245 

Ser Val Arg Glu Val Glu Arg Trp Leu Ser Pro He Leu Asn Tyr Val 
1250 1255 1260 

Pro Met Pro Glu Thr Glu Ala Ala Glu 
1265 1270 
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<210> 19 
<211> 3684 

<212> DNA ^ 
<2l3> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . (3681) 

<223> REC03905 

<400> 19 

gtg age age aaa gtg gaa caa ctg cgt gcg cag tta aat gaa cgt att 48 

Val Ser Ser Lys Val Glu Gin Leu Arg Ala Gin Leu Asn Glu Arg He 
1 5 10 15 

ctg gtg ctg gac ggc ggt atg ggc acc atg ate cag agt tat cga ctg 96 
Leu Val Leu Asp Gly Gly Met Gly Thr Met He Gin Ser Tyr Arg Leu 
20 25 30 

aac gaa gec gat ttt cgt ggt gaa cgc ttt gee gac tgg cca tgc gac 144 
Asn Glu Ala Asp Phe Arg Gly Glu Arg Phe Ala Asp Trp Pro Cys Asp 
35 40 45 

etc aaa ggc aac aac gac ctg ctg gta etc agt aaa ccg gaa gtg ate 192 
Leu Lys Gly Asn Asn Asp Leu Leu Val Leu Ser Lys Pro Glu Val He 
50 55 60 

gee get ate cac aac gee tac ttt gaa gcg ggc gcg gat ate ate gaa 240 
Ala Ala He His Asn Ala Tyr Phe Glu Ala Gly Ala Asp He He Glu 
65 70 75 80 

acc aac acc ttc aac tec acg acc att gcg atg gcg gat tac cag atg 288 
Thr Asn Thr Phe Asn Ser Thr Thr He Ala Met Ala Asp Tyr Gin Met 
85 90 95 

gaa tec ctg teg gcg gaa ate aac ttt gcg gcg gcg aaa ctg gcg cga 33 6 
Glu Ser Leu Ser Ala Glu He Asn Phe Ala Ala Ala Lys Leu Ala Arg 
100 105 no 

get tgt get gac gag tgg acc gcg cgc acg cca gag aaa ccg cgc tac 384 
Ala Cys Ala Asp Glu Trp Thr Ala Arg Thr Pro Glu Lys Pro Arg Tyr 
115 120 125 

gtt gee ggt gtt etc ggc ccg acc aac cgc acg gcg tct att tct ccg 432 
Val Ala Gly Val Leu Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro 
130 135 140 

gac gtc aac gat ccg gca ttt cgt aat ate act ttt gac ggg ctg gtg 480 
Asp Val Asn Asp Pro Ala Phe Arg Asn He Thr Phe Asp Gly Leu Val 
145 150 155 160 

gcg get tat cga gag tec acc aaa gcg ctg gtg gaa ggt ggc gcg gat 528 
Ala Ala Tyr Arg Glu Ser Thr Lys Ala Leu Val Glu Gly Gly Ala Asp 
165 170 175 

ctg ate ctg att gaa acc gtt ttc gac acc ctt aac gee aaa gcg gcg 576 
Leu lie Leu lie Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala 
180 185 190 

gta ttt gcg gtg aaa acg gag ttt gaa gcg ctg ggc gtt gag ctg ccg 624 
Val Phe Ala Val Lys Thr Glu Phe Glu Ala Leu Gly Val Glu Leu Pro 
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195 200 205 

att atg ate tec ggc ace ate ace gac gec tec ggg cgc acg etc tec 672 
lie Met He Ser Gly Thr He Thr Asp Ala Ser Gly Arg Thr Leu Ser 
210 215 220 

ggg cag ace acc gaa gca ttt tac aac tea ttg cgc cac gec gaa get 720 
Gly Gin Thr Thr Glu Ala Phe Tyr Asn Ser Leu Arg His Ala Glu Ala 
225 230 235 240 

ctg acc ttt ggc ctg aac tgt gcg ctg ggg ccc gat gaa ctg cgc cag 768 
Leu Thr Phe Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg Gin 
245 250 255 

tac gtg cag gag ctg tea egg att gcg gaa tgc tac gtc acc gcg cac 816 
Tyr Val Gin Glu Leu Ser Arg He Ala Glu CyB Tyr Val Thr Ala His 
260 265 270 

ccg aac gee ggg eta ccc aac gee ttt ggt gag tac gat etc gac gec 864 
Pro Asn Ala Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Asp Ala 
275 280 285 

gac acg atg gca aaa cag ata cgt gaa tgg gcg caa gcg ggt ttt etc 912 
Asp Thr Met Ala Lys Gin He Arg Glu Trp Ala Gin Ala Gly Phe Leu 
290 295 300 

aat ate gtc ggc ggc tgc tgt ggc acc acg cca caa cat att gca gcg 960 
Asn He Val Gly Gly Cys Cys Gly Thr Thr Pro Gin His He Ala Ala 
305 310 315 320 

atg agt cgt gca gta gaa gga tta gcg ccg cgc aaa ctg ccg gaa att 1008 
Met Ser Arg Ala Val Glu Gly Leu Ala Pro Arg Lys Leu Pro Glu He 
325 330 335 

ccc gta gee tgc cgt ttg tec ggc ctg gag ccg ctg aac att ggc gaa 1056 
Pro Val Ala Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn He Gly Glu 
340 345 350 

gat age ctg ttt gtg aac gtg ggt gaa cgc acc aac gtc acc ggt tec 1104 
Asp Ser Leu Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser 
355 360 365 

get aag ttc aag cgc ctg ate aaa gaa gag aaa tac age gag gcg ctg 1152 
Ala Lys Phe Lys Arg Leu He Lys Glu Glu Lys Tyr Ser Glu Ala Leu 
370 375 380 



gat gtc gcg cgt caa cag gtg gaa aac ggc gcg cag att ate gat ate 
Asp Val Ala Arg Gin Gin Val Glu Asn Gly Ala Gin He He Asp He 
385 390 395 400 



1200 



aac atg gat gaa ggg atg etc gat gee gaa gcg gcg atg gtg cgt ttt 1248 

Asn Met Asp Glu Gly Met Leu Asp Ala Glu Ala Ala Met Val Arg Phe 
405 410 415 

etc aat ctg att gec ggt gaa ccg gat ate get cgc gtg ccg att atg 1296 

Leu Asn Leu He Ala Gly Glu Pro Asp He Ala Arg Val Pro He Met 
420 425 " 430 

ate gac tec tea aaa tgg gac gtc att gaa aaa ggt ctg aag tgt ate 1344 

lie Asp Ser Ser Lys Trp Asp Val lie Glu Lys Gly Leu Lys Cys He 
435 440 445 
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cag ggc aaa ggc att gtt aac tct ate teg atg aaa gag ggc gtc gat 1392 
Gin Gly Lys Gly lie Val Asn Ser He Ser Met Lys Glu Gly Val Asp 
450 455 460 

gec'ttt ate cat cac gcg aaa ttg ttg cgt cgc tac ggt gcg gca gtg 1440 
Ala Phe He His His Ala Lys Leu Leu Arg Arg Tyr Gly Ala Ala Val 
465 470 475 480 

gtg gta atg gee ttt gac gaa cag gga cag gee gat act cgc gca egg 1488 
Val Val Met Ala Phe Asp Glu Gin Gly Gin Ala Asp Thr Arg Ala Arg 
485 490 495 

aaa ate gag att tgc cgt egg gcg tac aaa ate etc acc gaa gag gtt 1536 
Lys He Glu He Cys Arg Arg Ala Tyr Lys He Leu Thr Glu Glu Val 
500 505 510 

ggc ttc ccg cca gaa gat ate ate ttc gac cca aac ate ttc gcg gtc 1584 
Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Asn He Phe Ala Val 
515 520 525 

gca act ggc att gaa gag cac aac aac tac gcg cag gac ttt ate ggc 1632 
Ala Thr Gly He Glu Glu His Asn Asn Tyr Ala Gin Asp Phe He Gly 
530 535 540 

gcg tgt gaa gac ate aaa cgc gaa ctg ccg cac gcg ctg att tec ggc 1680 
Ala Cys Glu Asp He Lys Arg Glu Leu Pro His Ala Leu He Ser Gly 
545 550 555 560 

ggc gta tct aac gtt tct ttc teg ttc cgt ggc aac gat ccg gtg cgc 1728 
Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Pro Val Arg 
565 570 575 

gaa gee att cac gca gtg ttc etc tac tac get att cgc aat ggc atg 1776 
Glu Ala He His Ala Val Phe Leu Tyr Tyr Ala He Arg Asn Gly Met 
580 585 590 

gat atg ggg ate gtc aac gee ggg caa ctg gcg att tac gac gac eta 1824 
Asp Met Gly He Val Asn Ala Gly Gin Leu Ala He Tyr Asp Asp Leu 
595 600 605 

ccc get gaa ctg cgc gac gcg gtg gaa gat gtg att ctt aat cgt cgc 1872 
Pro Ala Glu Leu Arg Asp Ala Val Glu Asp Val He Leu Asn Arg Arg 
610 615 620 

gac gat ggc acc gag cgt tta ctg gag ctt gee gag aaa tat cgc ggc 1920 
Asp Abp Gly Thr Glu Arg Leu Leu Glu Leu Ala Glu Lys Tyr Arg Gly 
625 630 635 " 640 

age aaa acc gac gac acc gee aac gee cag cag gcg gag tgg cgc teg 1968 
Ser Lys Thr Asp Asp Thr Ala Asn Ala Gin Gin Ala Glu Trp Arg Ser 
645 650 655 

tgg gaa gtg aat aaa cgt ctg gaa tac teg ctg gtc aaa ggc att acc 2016 
Trp Glu Val Asn Lys Arg Leu Glu Tyr Ser Leu Val Lys Gly He Thr 
660 665 670 

gag ttt ate gag cag gat acc gaa gaa gee cgc cag cag get acg cgc 2064 
Glu Phe He Glu Gin Asp Thr Glu Glu Ala Arg Gin Gin Ala Thr Arg 
675 680 685 

ccg att gaa gtg att gaa ggc ccg ttg atg gac ggc atg aat gtg gtc 2112 
Pro He Glu Val He Glu Gly Pro Leu Met Asp Gly Met Asn Val Val 
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690 695 700 

ggc gac ctg ttt ggc gaa ggg aaa atg ttc ctg cca cag gtg gtc aaa 2160 
Gly Asp Leu Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val Lye 
705 710 715 720 

teg gcg cgc gtc atg aaa cag gcg gtg gec tac etc gaa ccg ttt att 2208 
Ser Ala Arg Val Met Lys Gin Ala Val Ala Tyr Leu Glu Pro Phe lie 
725 730 735 

gaa gec age aaa gag cag ggc aaa ace aac ggc aag atg gtg ate gee 2256 
Glu Ala Ser Lys Glu Gin Gly Lye Thr Aen Gly Lys Met Val He Ala 
740 745 750 



acc gtg aag ggc gac gtc cac gac ate ggt aaa aat ate gtt ggt gtg 
Thr val Lys Gly Asp Val His Asp He Gly Lys Asn He Val Gly Val 
755 760 765 



2304 



gtg ctg caa tgt aac aac tac gaa att gtc gat etc ggc gtt atg gtg 2352 
Val Leu Gin Cys Asn Asn Tyr Glu He Val Asp Leu Gly Val Met Val 
770 775 780 

cct gcg gaa aaa att etc cgt acc get aaa gaa gtg aat get gat ctg 2400 
Pro Ala Glu Lys He Leu Arg Thr Ala Lys Glu Val Asn Ala Asp Leu 
785 790 795 800 

att ggc ctt teg ggg ctt ate acg ccg teg ctg gac gag atg gtt aac 2448 
He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met Val Aan 
80S 810 815 

gtg gcg aaa gag atg gag cgt cag ggc ttc act att ccg tta ctg att 2496 
Val Ala Lys Glu Met Glu Arg Gin Gly Phe Thr lie Pro Leu Leu He 
820 825 830 

ggc ggc gcg acg acc tea aaa gcg cac acg gcg gtg aaa ate gag cag 2544 
Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys He Glu Gin 
835 840 845 

aac tac age ggc ccg acg gtg tat gtg cag aat gee teg cgt acc gtt 2592 
Asn Tyr Ser Gly Pro Thr Val Tyr Val Gin Asn Ala Ser Arg Thr Val 
850 , 855 860 

ggt gtg gtg gcg gcg ctg ctt tec gat acc cag cgt gat gat ttt gtc 2640 
Gly Val Val Ala Ala Leu Leu Ser Asp Thr Gin Arg Asp Asp Phe Val 
865 870 875 880 

get cgt acc cgc aag gag tac gaa acc gta cgt att cag cac ggg cgc 2688 
Ala Arg Thr Arg Lys Glu Tyr Glu Thr Val Arg He Gin His Gly Arg 
885 890 895 

aag aaa ccg cgc aca cca ccg gtc acg ctg gaa gcg gcg cgc gat aac 2736 
Lys Lys Pro Arg Thr Pro Pro Val Thr Leu Glu Ala Ala Arg Asp Asn 
900 905 910 

gat ttc get ttt gac tgg cag get tac acg ccg ccg gtg gcg cac cgt 2784 
Asp Phe Ala Phe Asp Trp Gin Ala Tyr Thr Pro Pro Val Ala His Arg 
915 920 925 

etc ggc gtg cag gaa gtc gaa gec age ate gaa acg ctg cgt aat tac 2832 
Leu Gly Val Gin Glu Val Glu Ala Ser He Glu Thr Leu Arg Asn Tyr 
930 935 940 
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ate gac tgg aca ccg ttc ttt atg acc tgg teg ctg gec ggg aag tat 2880 

He Asp Trp Thr Pro Phe Phe Met Thr Trp Ser Leu Ala Gly Lys Tyr 
945 950 955 960 

ccg' cgc att ctg gaa gat gaa gtg gtg ggc gtt gag gcg cag egg ctg 2928 
Pro Arg He Leu Glu Asp Glu Val Val Gly Val Glu Ala Gin Arg Leu 
965 970 975 

ttt aaa gac gec aac gac atg ctg gat aaa tta age gee gag aaa acg 2976 
Phe Lys Asp Ala Asn Asp Met Leu Asp, Lys Leu Ser Ala Gl\j Lys Thr 
980 985 9*0 

ctg aat ccg cgt ggc gtg gtg ggc ctg ttc ccg gca aac cgt gtg ggc 3024 
Leu Asn Pro Arg Gly Val Val Gly Leu Phe Pro Ala Asn Arg Val Gly 
995 1000 1005 

gat gac att gaa ate tac cgt gac gaa acg cgt acc cat gtg ate aac 3072 
Asp Asp He Glu He Tyr Arg Asp Glu Thr Arg Thr His Val He Asn 
1010 1015 1020 

gtc age cac cat ctg cgt caa cag acc gaa aaa aca ggc ttc get aac 3120 
Val Ser His His Leu Arg Gin Gin Thr Glu Lys Thr Gly Phe Ala Asn 
1025 1030 1035 * 1040 

tac tgt etc get gac ttc gtt gcg ccg aag ctt tct ggt aaa gca gat 3168 
Tyr Cys Leu Ala Asp Phe Val Ala Pro Lys Leu Ser Gly Lys Ala Asp 
1045 1050 1055 

tac ate ggc gca ttt gec gtg act ggc ggg ctg gaa gag gac gca ctg 3216 
Tyr He Gly Ala Phe Ala Val Thr Gly Gly Leu Glu Glu Asp Ala Leu 
1060 1065 1070 

get gat gee ttt gaa gcg cag cac gat gat tac aac aaa ate atg gtg 3264 
Ala Asp Ala Phe Glu Ala Gin His Asp Asp Tyr Asn Lys He Met Val 
1075 1080 1085 

aaa gcg ctt gec gac cgt tta gec gaa gec ttt gcg gag tat etc cat 3312 
Lys Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr Leu His 
1090 1095 1100 

gag cgt gtg cgt aaa gtc tac tgg ggc tat gcg ccg aac gag aac etc 3360 
Glu Arg Val Arg Lys Val Tyr Trp Gly Tyr Ala Pro Asn Glu Asn Leu 
H05 1110 ins 1120 

age aac gaa gag ctg ate cgc gaa aac tac cag ggc ate cgt ccg gca 3408 
Ser Asn Glu Glu Leu He Arg Glu Asn Tyr Gin Gly He Arg Pro Ala 
1125 1130 H35 

ccg ggc tat ccg gee tgc ccg gaa cat acg gaa aaa gee acc ate tgg 3456 
Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Glu Lys Ala Thr He Trp 
1140 1145 1150 

gag ctg ctg gaa gtg gaa aaa cac act ggc atg aaa etc aca gaa tct 3504 
Glu Leu Leu Glu Val Glu Lys His Thr Gly Met Lys Leu Thr Glu Ser 
1155 1160 1165 

ttc gee atg tgg ccc ggt gca teg gtt teg ggt tgg tac ttc age cac 3552 
Phe Ala Met Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe Ser His 
1170 1175 1180 

ccg gac age aag tac tac get gta gca caa att cag cgc gat cag gtt 3600 
Pro Asp Ser Lys Tyr Tyr Ala Val Ala Gin He Gin Arg Asp Gin Val 
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118 5 1190 H95 1200 

gaa gat tat gcc cgc cgt aaa ggt atg age gtt acc gaa gtt gag cgc 3648 
Glu Asp Tyr Ala Arg Arg Lys Gly Met Ser Val Thr Glu Val Glu Arg 
1205 1210 1215 

tgg ctg gca ccg aat ctg ggg tat gac gcg gac tga 3684 
Trp Leu Ala Pro Asn Leu Gly Tyr Asp Ala Asp 
1220 1225 



<210> 20 
<211> 1227 
<212> PRT 

<213> Escherichia coli 
<400> 20 

Val Ser Ser Lys Val Glu Gin Leu Arg Ala Gin Leu Asn Glu Arg lie 
1 5 io 15 

Leu Val Leu Asp Gly Gly Met Gly Thr Met lie Gin Ser Tyr Arg Leu 
20 25 30 

Asn Glu Ala Asp Phe Arg Gly Glu Arg Phe Ala Asp Trp Pro Cys Asp 
35 40 45 

Leu Lys Gly Asn Asn Asp Leu Leu Val Leu Ser Lys Pro Glu Val He 
50 55 60 

Ala Ala He His Asn Ala Tyr Phe Glu Ala Gly Ala Asp He He Glu 
fi 5 70 75 60 

Thr Asn Thr Phe Asn Ser Thr Thr He Ala Met Ala ABp Tyr Gin Met 
85 90 95 

Glu Ser Leu Ser Ala Glu He Asn Phe Ala Ala Ala Lys Leu Ala Arg 
100 105 no 

Ala Cys Ala Asp Glu Trp Thr Ala Arg Thr Pro Glu Lys Pro Arg Tyr 
115 120 125 

Val Ala Gly Val Leu Gly Pro Thr Asn Arg Thr Ala Ser lie Ser Pro 
130 135 140 

Asp Val Asn Asp Pro Ala Phe Arg Asn He Thr Phe Asp Gly Leu Val 
I 45 150 155 160 

Ala Ala Tyr Arg Glu Ser Thr Lys Ala Leu Val Glu Gly Gly Ala Asp 
165 170 175 

Leu He Leu He Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala 
180 185 190 

Val Phe Ala Val Lys Thr Glu Phe Glu Ala Leu Gly Val Glu Leu Pro 
135 200 205 

He Met He Ser Gly Thr lie Thr Asp Ala Ser Gly Arg Thr Leu Ser 
210 215 220 

Gly Gin Thr Thr Glu Ala Phe Tyr Asn Ser Leu Arg His Ala Glu Ala 
225 230 235 240 



WO 03/087386 PCT/EP03/0401O 

85 

Leu Thr Phe Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg Gin 
245 250 255 

Tyr Val Gin Glu Leu Ser Arg lie Ala Glu Cys Tyr Val Thr Ala His 
260 265 270 " . 

Pro Asn Ala Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Asp Ala 
275 280 285 

Asp Thr Met Ala Lys Gin lie Arg Glu Trp Ala Gin Ala Gly Phe Leu 
290 295 300 

Asn He Val Gly Gly Cys Cys Gly Thr Thr Pro Gin His lie Ala Ala 
305 310 315 320 

Met Ser Arg Ala Val Glu Gly Leu Ala Pro Arg Lys Leu Pro Glu He 
325 330 335 

Pro Val Ala Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn He Gly Glu 
340 345 350 

Asp Ser Leu Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser 
355 360 365 

Ala Lys Phe Lys Arg Leu He Lys Glu Glu Lys Tyr Ser Glu Ala Leu 
370 375 380 

Asp Val Ala Arg Gin Gin Val Glu Asn Gly Ala Gin He He Asp He 
385 390 395 400 

Asn Met Asp Glu Gly Met Leu Asp Ala Glu Ala Ala Met Val Arg Phe 
405 410 415 

Leu Asn Leu He Ala Gly Glu Pro Asp He Ala Arg Val Pro He Met 
420 425 430 

He Asp Ser Ser Lys Trp Asp Val He Glu Lys Gly Leu Lys Cys He 
435 440 445 

Gin Gly Lys Gly He Val Asn Ser He Ser Met Lys Glu Gly Val Asp 
450 455 460 

Ala Phe He His His Ala Lys Leu Leu Arg Arg Tyr Gly Ala Ala Val 
465 470 475 480 

Val Val Met Ala Phe Asp Glu Gin Gly Gin Ala Asp Thr Arg Ala Arg 
485 490 495 

Lys He Glu He Cys Arg Arg Ala Tyr Lys He Leu Thr Glu Glu Val 
500 505 sio 

Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Asn He Phe Ala Val 
515 520 525 

Ala Thr Gly He Glu Glu His Asn Asn Tyr Ala Gin Asp Phe He Gly 
530 535 540 

Ala Cys Glu Asp He Lys Arg Glu Leu Pro His Ala Leu He Ser Gly 
545 550 555 560 

Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Pro Val Arg 
565 570 575 
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Glu Ala lie His Ala Val Phe Leu Tyr Tyr Ala He Arg Asn Gly Met 
580 585 590 

Asp Met Gly He Val Asn Ala Gly Gin Leu Ala He Tyr Asp Abp Leu 
595 600 60S 

Pro Ala Glu Leu Arg Asp Ala Val Glu Asp Val He Leu Asn Arg Arg 
610 615 620 

Asp Asp Gly Thr Glu Arg Leu Leu Glu Leu Ala Glu Lys Tyr Arg Gly 
625 630 635 ' 640 

Ser Lys Thr Asp Asp Thr Ala Asn Ala Gin Gin Ala Glu Trp Arg Ser 
645 650 655 

Trp Glu Val Asn Lys Arg Leu Glu Tyr Ser Leu Val Lys Gly He Thr 
660 665 670 

Glu Phe He Glu Gin Asp Thr Glu Glu Ala Arg Gin Gin Ala Thr Arg 
675 680 685 

Pro He Glu Val He Glu Gly Pro Leu Met Asp Gly Met Asn Val Val 
690 695 700 

Gly Asp Leu Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys 
705 710 715 720 

Ser Ala Arg Val Met Lys Gin Ala Val Ala Tyr Leu Glu Pro Phe He 
725 730 735 

Glu Ala Ser Lys Glu Gin Gly Lys Thr Asn Gly Lys Met Val He Ala 
740 745 750 

Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val Gly Val 
755 760 765 

Val Leu Gin Cys Asn Asn Tyr Glu He Val Asp Leu Gly Val Met Val 
770 775 780 

Pro Ala Glu Lys He Leu Arg Thr Ala Lys Glu Val Asn Ala Asp Leu 
785 790 795 800 

He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met Val Asn 
805 810 815 

Val Ala Lys Glu Met Glu Arg Gin Gly Phe Thr He Pro Leu Leu He 
820 825 830 

Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys He Glu Gin 
835 840 B45 

Asn Tyr Ser Gly Pro Thr Val Tyr Val Gin Asn Ala Ser Arg Thr Val 
850 855 860 

Gly Val Val Ala Ala Leu Leu Ser Asp Thr Gin Arg Asp Asp Phe Val 
fi 65 870 875 880 

Ala Arg Thr Arg Lys Glu Tyr Glu Thr Val Arg He Gin His Gly Arg 
885 890 895 

Lys Lys Pro Arg Thr Pro Pro Val Thr Leu Glu Ala Ala Arg Asp Asn 
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900 905 9io 

Asp Phe Ala Phe Asp Trp Gin Ala Tyr Thr Pro Pro Val Ala His Arg 
915 920 925 

Leu Gly Val Gin Glu Val Glu Ala Ser lie Glu Thr Leu Arg Asn Tyr 
930 935 940 

He Asp Trp Thr Pro Phe Phe Met Thr Trp Ser Leu Ala Gly Lya Tyr 
»45 950 955 960 

Pro Arg He Leu Glu Asp Glu Val Val Gly Val Glu Ala Gin Arg Leu 
965 970 975 

Phe Lys Asp Ala Asn Asp Met Leu Asp Lys Leu Ser Ala Glu Lys Thr 
980 985 990 

Leu Asn Pro Arg Gly Val Val Gly Leu Phe Pro Ala Asn Arg Val Gly 
995 1000 1005 

Asp Asp He Glu He Tyr Arg Asp Glu Thr Arg Thr His Val He Asn 
1010 1015 1020 

Val Ser His His Leu Arg Gin Gin Thr Glu Lys Thr Gly Phe Ala Asn 
1025 1030 1035 1040 

Tyr Cys Leu Ala Asp Phe Val Ala Pro Lys Leu Ser Gly Lys ' Ala Asp 
1045 1050 * ,1055 

Tyr He Gly Ala Phe Ala Val Thr Gly Gly Leu Glu Glu Asp Ala Leu 
1060 1065 1070 

Ala Asp Ala Phe Glu Ala Gin His Asp Asp Tyr Asn Lys He Met Val 
1075 1080 1085 

Lys Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr Leu His 
1090 1095 1100 

Glu Arg Val Arg Lys Val Tyr Trp Gly Tyr Ala Pro Asn Glu Asn Leu 
1105 llio ins 1120 

Ser Asn Glu Glu Leu He Arg Glu Asn Tyr Gin Gly He Arg Pro Ala 
1125 1130 H35 

Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Glu Lys Ala Thr He Trp 
1140 H45 1150 

Glu Leu Leu Glu Val Glu Lys His Thr Gly Met Lys Leu Thr Glu Ser 
1155 1160 1165 

Phe Ala Met Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe Ser His 
1170 H75 1180 

Pro Asp Ser Lys Tyr Tyr Ala Val Ala Gin He Gin Arg Asp Gin Val 
HAS 1190 H95 1200 

Glu Asp Tyr Ala Arg Arg Lys Gly Met Ser Val Thr Glu Val Glu Arg 
1205 1210 1215 

Trp Leu Ala Pro Asn Leu Gly Tyr Asp Ala Asp 
1220 1225 
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<210> 21 
<211> 3771 
<212> DMA 

<213> Salmonella typhimuriuro 

<220> 
<221> CDS 
<222> (1)..(3768) 
<223> RSY02996 

<400> 21 

atg tct cat gtt gcc cgt tgt tct ctt ttc cgc cag cac get ttg tgc 48 

Met Ser His Val Ala Arg Cys Ser Leu Phe Arg Gin His Ala Leu Cys 
1 5 10 15 

cag tat ggc teg tta cgt gga gcg ttg teg gga gcg agt gtg age age 96 
Gin Tyr Gly Ser Leu Arg Gly Ala Leu Ser Gly Ala Ser Val Ser Ser 
20 25 30 

aaa gtt gaa caa ctg cgt gcg cag tta aat gaa cgt att ctg gtg ctg 144 
Lys Val Glu Gin Leu Arg Ala Gin Leu Asn Glu Arg lie Leu Val Leu 
35 40 45 

gac ggc ggt atg ggc act atg ate cag age tat cgt eta cat gaa gaa 192 
Asp Gly Gly Met Gly Thr Met lie Gin Ser Tyr Arg Leu His Glu Glu 
50 55 60 

gat ttc cgc ggg gag cgc ttt gcc gac tgg ccc tgc gac ctg aaa ggc 240 
Asp Phe Arg Gly Glu Arg Phe Ala Asp Trp Pro Cys Asp Leu Lys Gly 
65 70 75 80 

aac aat gac ctg ctg gtc etc age aag ccg gag gtg ate gcc get ate 288 
Asn Asn Asp Leu Leu Val Leu Ser Lys Pro Glu Val He Ala Ala He 
85 90 95 

cac aac gcc tac ttt gag get ggc gcg gat ate ate gaa acc aac ace 336 
His Asn Ala Tyr Phe Glu Ala Gly Ala Asp He He Glu Thr Asn Thr 
100 105 110 

ttt aac teg aca acc att gcg atg gcg gat tac egg atg gaa tec ctg 384 
Phe Asn Ser Thr Thr He Ala Met Ala Asp Tyr Arg Met Glu Ser Leu 
115 120 125 

teg gcg gaa att aac tat gcg gcg gcc aaa ctg gcg cgc gcc tgc gcc 432 
Ser Ala Glu He Asn Tyr Ala Ala Ala Lye Leu Ala Arg Ala Cys Ala 
130 135 140 

gat gaa tgg acg gcg cga aca cca gaa aaa cca cgc ttt gtt gcg ggc 480 
Asp Glu Trp Thr Ala Arg Thr Pro Glu Lys Pro Arg Phe Val Ala Gly 
145 150 155 160 

gtg ctt ggt cca act aac cgc acg gcc tec att teg ccg gac gtc aac 528 
Val Leu Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro Asp Val Asn 
165 170 175 

gac ccg gcg ttt cgt aat ate acc ttc gat cag ctg gtg gcg gcc tac 576 
Asp Pro Ala Phe Arg Asn He Thr Phe Asp Gin Leu Val Ala Ala Tyr 
180 185 190 

cgt gaa tec acc aaa gcg ctg gtg gaa ggt ggc gca gat ctg att ctg 624 
Arg Glu Ser Thr Lys Ala Leu Val Glu Gly Gly Ala Asp Leu He Leu 
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195 200 205 

att gag acc gtt ttt gac acc ctg aat gcg aaa gcg gcg gtg ttt gcg 672 
lie Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala Val Phe Ala 
210 215 220 

gtg aaa gaa gag ttt gaa gcg ctg ggc gtt gac ctg ccg ate atg att 720 
Val Lys Glu Glu Phe Glu Ala Leu Gly Val Asp Leu Pro lie Met lie 
22 5 230 235 240 

tec ggc acc ate acc gac gec tct ggc cgt acg ctt tec ggc cag act 768 
Ser Gly Thr lie Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr 
245 250 255 

acc gaa gec ttt tat aac teg ctg cgc cac gec gag gcg etc act ttt 816 
Thr Glu Ala Phe Tyr Asn Ser Leu Arg His Ala Glu Ala Leu Thr Phe 
260 265 270 

ggc ctt aac tgc gca ctg ggg cca gat gaa ctg cgc cag tac gtc cag 864 
Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg Gin Tyr Val Gin 
275 280 285 

gaa ctg teg egg att gec gaa tgc tac gtc acc gcg cac ccg aac gec 912 
Glu Leu Ser Arg lie Ala Glu Cys Tyr Val Thr Ala His Pro Asn Ala 
290 295 300 

ggc ctg ccg aac get ttc ggc gag tat gac etc gac gee gac acc atg 960 
Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Asp Ala Asp Thr Met 
305 310 315 320 

gcg aaa cag att cgc gaa tgg gcg gaa gcg ggc ttc ctg aat ate gtt 1008 
Ala Lys Gin He Arg Glu Trp Ala Glu Ala Gly Phe Leu Asn He Val 
325 330 335 

ggc ggc tgc tgc ggc acc acg ccg gag cat att gcg gcg atg age cgc 1056 
Gly Gly Cys Cys Gly Thr Thr Pro Glu His He Ala Ala Met Ser Arg 
340 345 350 

gee gtt gee ggt ttg ctg ccg cgc cag ctg ccg gat ate ccg gtc gec 1104 
Ala Val Ala Gly Leu Leu Pro Arg Gin Leu Pro Asp He Pro Val Ala * 
355 360 365 

tgc cgc ctt tec ggc ctg gag ccg ctg aac att ggc gac gat age ctg 1152 
Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn He Gly Asp Asp Ser Leu 
370 375 380 



ttt gtg aac gtc ggc gaa cgt act aac gtc acc ggc teg gec aaa ttt 
Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala Lys Phe 
385 390 395 



1200 



400 



aaa egg ctg ate aaa gaa gag aaa tac age gaa gcg ctg gat gtc gec 1248 
Lys Arg Leu He Lys Glu Glu Lys Tyr Ser Glu Ala Leu Asp Val Ala 
405 410 415 

cgt cag cag gta gaa age ggc gcg cag att att gat ate aat atg gat 1296 
Arg Gin Gin Val Glu Ser Gly Ala Gin He He Asp He Asn Met Asp 
420 425 430 

gag ggg atg etc gac gee gaa gcg gcg atg gtg cgt ttc etc age ctg 1344 
Glu Gly Met Leu Asp Ala Glu Ala Ala Met Val Arg Phe Leu Ser Leu 
435 440 445 
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att gcc ggt gag ccg gac att gcc cgt gta cca ate atg ate gac tec 1392 

He Ala Gly Glu Pro Asp He Ala Arg Val Pro He Met lie Asp Ser 
450 455 460 

tec aaa tgg gag gtt ate gaa aaa ggg ctg aag tgc att cag ggt aaa 1440 
Ser LyB Trp Glu Val He Glu Lye Gly Leu Lye Cys He Gin Gly Lys 

465 470 475 480 

ggq ate gtc aac tct att teg atg aaa gag ggc gtg gaa gcc ttt att 1488 

Gly He Val Asn Ser He Ser Met Lys Glu Gly Val Glu Ala Phe He 
465 490 495 

cat cat gcg aag ctt ctg cgt cgc tac ggc gcg gca gtg gtg gtg atg 1536 

His His Ala LyB Leu Leu Arg Arg Tyr Gly Ala Ala Val Val Val Met 
500 505 510 

gcc ttt gat gag cag ggg cag gcc gat ace cgc gcg cgt aaa ate gaa 1584 

Ala Phe Asp Glu Gin Gly Gin Ala Asp Thr Arg Ala Arg Lys He Glu 
515 520 525 

att tgc cgc cga gcc tac aaa att etc ace gaa gag gtg ggt ttc ccg 1632 

He Cys Arg Arg Ala Tyr Lys He Leu Thr Glu Glu Val Gly Phe Pro 
530 535 540 

ccg gaa gac ate ate ttc gac ccg aat ate ttc gcc gtg gcg ace ggt 1680 
Pro Glu Asp He He Phe Asp Pro Asn He Phe Ala Val Ala Thr Gly 

545 550 555 560 

att gaa gag cac aac aac tac gcg cag gac ttt ate ggc get tgc gaa 1728 

He Glu Glu His Asn Asn Tyr Ala Gin Asp Phe He Gly Ala Cys Glu 
565 570 * 575 

gac ate aaa cgc gag ctg ccg cac gcg ctg ate tec ggc ggc gtg tct 1776 

Asp He Lys Arg Glu Leu Pro His Ala Leu He Ser Gly Gly Val Ser 
580 585 590 

aac gtg tec ttc teg ttc cgc ggc aac gac ccg gta cgt gag get ate 1824 

Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Pro Val Arg Glu Ala He 
595 600 605 

cac gcg gta ttc etc tac tac gcc ate cgc aac ggt atg gac atg ggc 1872 

His Ala Val Phe Leu Tyr Tyr Ala He Arg Asn Gly Met Asp Met Gly 
610 615 620 

ate gtc aac gcc gga cag ctg get ate tac gac gac ctg ccc gcc gag 1920 

He Val Asn Ala Gly Gin Leu Ala He Tyr Asp Asp Leu Pro Ala Glu 

625 630 635 640 

ctg cgc gat gcg gtt gaa gat gtc att ctt aac cgt cgc gat gac ggc 1968 

Leu Arg Asp Ala Val Glu Asp Val He Leu Asn Arg Arg Asp Asp Gly 
645 650 655 

act gag cgt ttg ctg gat ttg gcg gag aaa tac cgc ggc age aaa acc 2016 

Thr Glu Arg Leu Leu Asp Leu Ala Glu Lys Tyr Arg Gly Ser Lys Thr 
660 665 " 670 

gac gaa get gcc aac gcc cag cag gcg gaa tgg cgt age tgg gac gtg 2064 

Asp Glu Ala Ala Asn Ala Gin Gin Ala Glu Trp Arg Ser Trp Asp Val 
675 680 ' 685 

aaa aag cgt etc gaa tac teg ctg gtg aaa ggc att acc gaa ttt ate 2112 

Lys Lys Arg Leu Glu Tyr Ser Leu Val Lys Gly He Thr Glu Phe He 
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690 695 700 

gaa cag gat acc gaa gaa gcc cgt cag cag gcc gcc cgc ccg att gag 2160 
Glu Gin Asp Thr Gl\i Glu Ala Arg Gin Gin Ala Ala Arg Pro He Glu 
705 710 715 720 

gtg att gaa ggg ccg ttg atg gac ggc atg aac gtg gtc ggc gac ctg 2208 
Val He Glu Gly Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu 
725 730 735 

ttc ggc gaa ggg aaa atg ttc ctg ccg cag gtg gtg aaa tec get cgc 2256 
Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg 
740 745 750 

gtg atg aaa caa gcg gtg gcc tac ctg gag ccg ttt att gaa gcc age 2304 
Val Met Lys Gin Ala Val Ala Tyr Leu Glu Pro Phe He Glu Ala Ser 
755 760 765 

aaa gaa aaa ggc tec age aac ggc aag atg gtg ate gcc acc gtg aag 2352 
Lys Glu Lys Gly Ser Ser Asn Gly Lys Met Val He Ala Thr Val Lye 
770 775 780 

ggc gat gtg cac gat att ggt aaa aat ate gtt ggc gtg gtg ctg caa 2400 
Gly Asp Val His Asp He Gly Lys Asn He Val Gly Val Val Leu Gin 
785 790 795 800 

tgt aac aac tac gaa ate gtc gat ctt ggc gtg atg gtg cca gcg gag 2448 
Cys Asn Asn Tyr Glu He Val Asp Leu Gly Val Met Val Pro Ala Glu 
805 810 815 

aaa ate etc aga acg gcg cgt gaa gtg aat gcc gat ctg ate ggt ctt 2496 
Lys He Leu Arg Thr Ala Arg Glu Val Asn Ala Asp Leu He Gly Leu 
820 825 830 

tec ggg ctg att acc ccg teg ctg gac gaa atg gtc aac gtg gcg aaa 2544 
Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met Val Asn Val Ala Lys 
835 840 845 

gag atg gag cgt cag ggc ttt act ate ccg eta ctg ate ggc ggc gca 2592 
Glu Met Glu Arg Gin Gly Phe Thr He Pro Leu Leu He Gly Gly Ala 
850 855 860 

acc act teg aaa gcg cat acg gcg gtg aaa ate gag cag aac tac age 2640 
Thr Thr Ser Lys Ala His Thr Ala Val Lys He Glu Gin Asn Tyr Ser 
865 870 875 880 

ggt ccg acg gtc tac gtg cag aat get teg cgt acc gtg ggc gtg gtg 2688 
Gly Pro Thr Val Tyr Val Gin Asn Ala Ser Arg Thr Val Gly Val Val 
885 890 895 

gcg gcg eta etc tec gac acc cag cgt gat gac ttt gtc gcc cgt acc 2736 
Ala Ala Leu Leu Ser Asp Thr Gin Arg Asp Asp Phe Val Ala Arg Thr 
900 905 910 

cgc aaa gag tac gaa acc gtg cgt att cag cac gcc cgc aaa aaa ccg 2784 
Arg Lys Glu Tyr Glu Thr Val Arg He Gin His Ala Arg Lys Lys Pro 
915 920 925 

cgc acg ccg ccg gtc acg ctg gag gcg gcg cgc gat aac gat ctg gca 2832 
Arg Thr Pro Pro Val Thr Leu Glu Ala Ala Arg Asp Asn Asp Leu Ala 
930 935 940 
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ttc gat tgg gaa cgc tac acc ccg ccg gta gcc cac cgt ctg ggc gtg 
Phe Asp Trp Glu Arg Tyr Thr Pro Pro Val Ala His Arg Leu Gly Val 
945 950 955 960 



2880 



cag gag gtg gaa gcc age ate gaa act ctg cgc aac tat ate gac tgg 2928 
Gin Glu Val Glu Ala Ser He Glu Thr Leu Arg Asn Tyr He Asp Trp 
965 970 975 

acg ccg ttc ttt atg acc tgg teg ctg gcc ggc aaa tac ccg cgc att 2976 
Thr Pro Phe Phe Met Thr Trp Ser Leu Ala Gly Lys Tyr Pro Arg He 
980 985 990 

ctg gaa gat gag gtg gtg ggc gtt gag gcg cag cgt ctg ttt aaa gac 3024 
Leu Glu Asp Glu Val Val Gly Val Glu Ala Gin Arg Leu Phe Lys Asp 
995 1000 1005 

gcc aat gat atg ctg gat aaa ctg age gcc gag aaa ctg ttg aat ccg 3072 
Ala Asn Asp Net Leu Asp Lys Leu Ser Ala Glu Lys Leu Leu Asn Pro 
1010 1015 1020 

cgt ggc gtg gtg ggc ctg ttc ccg gcg aac cgt gtg ggt gac gac ate 3120 
Arg Gly Val Val Gly Leu Phe Pro Ala Asn Arg Val Gly Asp Asp He 
1025 1030 1035 1040 

gaa ate tat cgc gac gaa acc cgt act cat gtt ctg acg gtc age cac 3166 
Glu lie Tyr Arg Asp Glu Thr Arg Thr His Val Leu Thr Val Ser His 
104S 1050 1055 

cac ctg cgc cag cag acc gag aaa gtc ggt ttt gcc aac tac tgt ctg 3216 
His Leu Arg Gin Gin Thr Glu Lys Val Gly Phe Ala Asn Tyr Cys Leu 
1060 1065 1070 

gcg gat ttt gtc gcg ccg aaa ctg age ggc aaa gcg gat tac ate ggt 3264 
Ala Asp Phe Val Ala Pro Lys Leu Ser Gly Lys Ala Asp Tyr He Gly 
1075 1080 1085 

get ttc gcg gtg acc ggc ggt ctg gag gag gat gcg ctg gcg gac gcc 3312 
Ala Phe Ala Val Thr Gly Gly Leu Glu Glu Asp Ala Leu Ala Asp Ala 
1090 1095 1100 

ttc gaa gcg caa cac gac gac tat aac aag ate atg gtg aaa gcg att 3360 
Phe Glu Ala Gin His Asp Asp Tyr Asn Lys He Met Val Lys Ala He 
1105 1110 1115 1120 



gcc gac cgt ctg gcg gaa gcg ttc gcg gaa tat ctg cat gag cgt gta 3408 
Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr Leu His Glu Arg Val 
1125 1130 1135 

cgt aag gtt tac tgg gga tat gcg ccg aac gag age ctg agt aac gac 3456 
Arg Lys Val Tyr Trp Gly Tyr Ala Pro Asn Glu Ser Leu Ser Asn Asp 
1140 1145 1150 

gaa tta ate cgc gaa aac tac cag ggg att cgc ccg gcg ccg ggt tat 3504 
Glu Leu He Arg Glu Asn Tyr Gin Gly He Arg Pro Ala Pro Gly Tyr 
1155 1160 1165 

cct gcc tgc ccg gaa cat acc gaa aaa ggc act ate tgg cag eta ctg 3552 
Pro Ala Cys Pro Glu His Thr Glu Lys Gly Thr He Trp Gin Leu Leu 
1170 1175 1180 

gat gtc gaa aaa cac acc ggg atg aag etc acc gaa tct ttc gcc atg 3600 
Asp Val Glu Lys His Thr Gly Met Lys Leu Thr Glu Ser Phe Ala Met 
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1185 1190 1195 1200 

tgg cca ggc gcg teg gtc tec ggc tgg tac ttc age cat cct gag age 3648 
Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe Ser His Pro Glu Ser 
1205 1210 1215 

aaa tac ttc gcg gta gcg cag ate caa cgc gat cag gtg aca gat tat 3696 
Lya Tyr Phe Ala Val Ala Gin He Gin Arg Asp Gin Val Thr Asp Tyr 
1220 1225 * 1230 

get ttc cgt aaa gga atg age gtt gag gat gtt gag egg tgg etc gcg 3744 
Ala Phe Arg Lys Gly Met Ser Val Glu Asp Val Glu Arg Trp Leu Ala 
1235 1240 * 1245 

ccg aac ctg ggt tac gat gcg gac tga 3771 
Pro Asn Leu Gly Tyr Asp Ala Asp 
1250 1255 



<210> 22 
<211> 1256 
<212> PRT 

<213> Salmonella typhimurium 
<400> 22 

Met Ser His Val Ala Arg Cye Ser Leu Phe Arg Gin His Ala Leu CyB 
1 5 10 15 

Gin Tyr Gly Ser Leu Arg Gly Ala Leu Ser Gly Ala Ser Val Ser Ser 
20 25 30 

Lys Val Glu Gin Leu Arg Ala Gin Leu Asn Glu Arg He Leu Val Leu 
35 40 45 

Asp Gly Gly Met Gly Thr Met He Gin Ser Tyr Arg Leu His Glu Glu 
50 55 60 

Asp Phe Arg Gly Glu Arg Phe Ala Asp Trp Pro Cys Asp Leu Lys Gly 
65 70 75 80 

Asn Asn Asp Leu Leu Val Leu Ser Lys Pro Glu Val He Ala Ala He 
85 90 95 

His Asn Ala Tyr Phe Glu Ala Gly Ala Asp He He Glu Thr Asn Thr 
100 105 no 

Phe Asn Ser Thr Thr He Ala Met Ala Asp Tyr Arg Met Glu Ser Leu 
115 120 * 125 

Ser Ala Glu He Asn Tyr Ala Ala Ala Lys Leu Ala Arg Ala Cys Ala 
130 135 140 

Asp Glu Trp Thr Ala Arg Thr Pro Glu Lys Pro Arg Phe Val Ala Gly 
145 150 155 160 

Val Leu Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro Asp Val Asn 
165 170 175 

Asp Pro Ala Phe Arg Asn He Thr Phe Asp Gin Leu Val Ala Ala Tyr 
1B0 165 190 

Arg Glu Ser Thr Lys Ala Leu Val Glu Gly Gly Ala Asp Leu He Leu 
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195 



200 



205 



lie Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala Val Phe Ala 
210 215 220 

Val Lys Glu Glu Phe Glu Ala Leu Gly Val Asp Leu Pro He Met He 
225 230 235 240 

Ser Gly Thr He Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr 
245 250 255 

Thr Glu Ala Phe Tyr Asn Ser Leu Arg His Ala Glu Ala Leu Thr Phe 
260 265 270 

Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg Gin Tyr Val Gin 
275 280 285 

Glu Leu Ser Arg He Ala Glu Cys Tyr Val Thr Ala His Pro Asn Ala 
290 295 300 

Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Asp Ala Asp Thr Met 
305 310 315 320 

Ala Lys Gin He Arg Glu Trp Ala Glu Ala Gly Phe Leu Asn He Val 
325 330 335 

Gly Gly Cys Cys Gly Thr Thr Pro Glu His He Ala Ala Met Ser Arg 
340 345 350 

Ala Val Ala Gly Leu Leu Pro Arg Gin Leu Pro Asp He Pro Val Ala 
355 360 365 

Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn He Gly Asp Asp Ser Leu 
370 375 380 

Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala Lys Phe 
335 390 395 400 

Lys Arg Leu He Lys Glu Glu Lys Tyr Ser Glu Ala Leu Asp Val Ala 
405 410 415 

Arg Gin Gin Val Glu Ser Gly Ala Gin He He Asp He Asn Met Asp 
420 425 430 

Glu Gly Met Leu Asp Ala Glu Ala Ala Met Val Arg phe Leu Ser Leu 
435 440 445 

He Ala Gly Glu Pro Asp He Ala Arg Val Pro He Met He Asp Ser 
450 455 460 

Ser LyB Trp Glu Val He Glu Lys Gly Leu Lys Cys He Gin Gly Lys 
465 470 475 480 

Gly He Val Asn Ser He Ser Met Lys Glu Gly Val Glu Ala Phe He 
485 490 495 

His His Ala Lys Leu Leu Arg Arg Tyr Gly Ala Ala Val Val Val Met 
500 505 510 

Ala Phe Asp Glu Gin Gly Gin Ala Asp Thr Arg Ala Arg Lys He Glu 



515 



520 



525 



f 
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He Cys Arg Arg Ala Tyr Lys He Leu Thr Glu Glu Val Gly Phe Pro 
530 535 540 

Pro Glu Asp He He Phe Asp Pro Aen He Phe Ala Val Ala Thr Gly 
545 550 555 560 

He Glu Glu His Aen Asn Tyr Ala Gin Asp Phe lie Gly Ala Cys Glu 
565 570 575 

Asp lie Lys Arg Glu Leu Pro His Ala Leu He Ser Gly Gly Val Ser 
580 585 590 

Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Pro Val Arg Glu Ala He 
595 600 605 

His Ala Val Phe Leu Tyr Tyr Ala He Arg Asn Gly Met Asp Met Gly 
610 615 620 

He Val Asn Ala Gly Gin Leu Ala He Tyr ABp Asp Leu Pro Ala Glu 
"5 630 635 640 

Leu Arg Asp Ala Val Glu Asp Val He Leu Asn Arg Arg Asp Asp Gly 
645 650 655 

Thr Glu Arg Leu Leu Asp Leu Ala Glu Lys Tyr Arg Gly Ser Lys Thr 
660 665 670 

Asp Glu Ala Ala Asn Ala Gin Gin Ala Glu Trp Arg Ser Trp Asp Val 
675 680 685 

Lys Lys Arg Leu Glu Tyr Ser Leu Val Lys Gly He Thr Glu Phe He 
690 695 700 

Glu Gin Asp Thr Glu Glu Ala Arg Gin Gin Ala Ala Arg Pro He Glu 
705 710 715 ~ 720 

Val lie Glu Gly Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu 
725 730 735 

Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg 
740 745 750 

Val Met Lys Gin Ala Val Ala Tyr Leu Glu Pro Phe He Glu Ala Ser 
755 760 765 

Lys Glu Lys Gly Ser Ser Asn Gly Lys Met Val He Ala Thr Val Lys 
770 775 780 

Gly Asp Val His Asp He Gly Lys Asn lie Val Gly Val Val Leu Gin 
785 790 795 800 

Cys Asn Asn Tyr Glu lie Val Asp Leu Gly Val Met Val Pro Ala Glu 
805 810 815 

Lys He Leu Arg Thr Ala Arg Glu Val Asn Ala Asp Leu lie Gly Leu 
820 825 830 

Ser Gly Leu lie Thr Pro Ser Leu Asp Glu Met Val Asn Val Ala Lys 
835 840 845 

Glu Met Glu Arg Gin Gly Phe Thr lie Pro Leu Leu He Gly Gly Ala 
850 855 860 
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Thr Thr Ser Lys Ala His Thr Ala Val Lys He Glu Gin Asn Tyr Ser 
865 870 875 880 

Gly Pro Thr Val Tyr Val Gin Asn Ala Ser Arg Thr Val Gly Val Val 
885 890 895 

Ala Ala Leu Leu Ser Asp Thr Gin Arg Asp Asp Phe Val Ala Arg Thr 
900 905 910 

Arg Lys Glu Tyr Glu Thr Val Arg He Gin His Ala Arg Lys Lys Pro 
915 920 925 

Arg Thr Pro Pro Val Thr Leu Glu Ala Ala Arg Asp Asn Asp Leu Ala 
930 935 940 

Phe Asp Trp Glu Arg Tyr Thr Pro Pro Val Ala His Arg Leu Gly Val 
945 950 955 ~ 960 

Gin Glu Val Glu Ala Ser He Glu Thr Leu Arg Asn Tyr He Asp Trp 
965 970 975 

Thr Pro Phe Phe Met Thr Trp Ser Leu Ala Gly Lys Tyr Pro Arg He 
980 985 990 

Leu Glu Asp Glu Val Val Gly Val Glu Ala Gin Arg Leu Phe Lys Asp 
995 1000 1005 

Ala Asn Asp Met Leu Asp Lys Leu Ser Ala Glu Lys Leu Leu Asn Pro 
1010 1015 1020 

Arg Gly Val Val Gly Leu Phe Pro Ala Asn Arg Val Gly Asp Asp He 
1025 1030 1035 1040 

Glu He Tyr Arg Asp Glu Thr Arg Thr His Val Leu Thr Val Ser His 
1045 1050 1055 

His Leu Arg Gin Gin Thr Glu Lys Val Gly Phe Ala Asn Tyr Cys Leu 
1060 1065 1070 

Ala Asp Phe Val Ala Pro Lys Leu Ser Gly Lys Ala Asp Tyr He Gly 
1075 1080 1085 

Ala Phe Ala Val Thr Gly Gly Leu Glu Glu Asp Ala Leu Ala Asp Ala 
1090 1095 1100 

Phe Glu Ala Gin His Asp Asp Tyr Asn Lys He Met Val Lys Ala He 
1105 1110 1115 1120 

Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr Leu His Glu Arg Val 
1125 1130 1135 

Arg Lys Val Tyr Trp Gly Tyr Ala Pro Asn Glu Ser Leu Ser Asn Asp 
1140 1145 1150 

Glu Leu He Arg Glu Asn Tyr Gin Gly He Arg Pro Ala Pro Gly Tyr 
1155 1160 1165 

Pro Ala Cys Pro Glu His Thr Glu Lys Gly Thr He Trp Gin Leu Leu 
1170 1175 1180 

Asp Val Glu Lys His Thr Gly Met Lys Leu Thr Glu Ser Phe Ala Met 
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1185 1190 U95 1200 

Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe Ser His Pro Glu Ser 
1205 1210 1215 

Lye Tyr Phe Ala Val Ala Gin He Gin Arg Asp Gin Val Thr Asp Tyr 
1220 1225 1230 

Ala Phe Arg Lys Gly Met Ser Val Glu Asp Val Glu Arg Trp Leu Ala 
1235 1240 1245 

Pro Asn Leu Gly Tyr Asp Ala Asp 
1250 1255 



<210> 23 
<211> 3771 
<212> DNA 

<213> Salmonella typhi 

<220> 
<221> CDS 
<222> (1)..(3768) 
<223> RTY03686 

<400> 23 

atg tct cat gtt gcc cgt tgt tct ctt ttc cgc cag cac get ttg tgc 48 
Met Ser His Val Ala Arg Cye Ser Leu Phe Arg Gin His Ala Leu Cys 
1 5 io 15 

cag tat ggc teg tta cgt gga gcg ttg teg gga gcg agt gtg age age 9$ 
Gin Tyr Gly Ser Leu Arg Gly Ala Leu Ser Gly Ala Ser Val Ser Ser 
20 25 30 

aaa gtt gaa caa ctg cgt gcg cag tta aat gaa cgt att ctg gtg ctg 144 
Lys Val Glu Gin Leu Arg Ala Gin Leu Asn Glu Arg lie Leu Val Leu 
35 40 45 

gac ggc ggt atg ggc ace atg ate cag age tat cgt eta cat gaa gaa 192 
Asp Gly Gly Met Gly Thr Met He Gin Ser Tyr Arg Leu His Glu Glu 
50 55 60 

gat ttc cgc ggg gag cgc ttt gcc gac tgg ccc tgc gac ctg aaa ggc 240 
Asp Phe Arg Gly Glu Arg Phe Ala Asp Trp Pro Cys Asp Leu Lys Gly 
65 70 75 80 

aac aat gac ctg ctg gtc etc age aag ccg gag gtg ate gcc get ate 288 
Asn Asn Asp Leu Leu Val Leu Ser Lys Pro Glu Val He Ala Ala He 
85 90 95 

cac aac gcc tac ttt gag get ggc gcg gat ate ate gaa ace aac ace 336 
His Asn Ala Tyr Phe Glu Ala Gly Ala Asp He He Glu Thr Asn Thr 
100 105 no 

ttt aac teg aca ace att gcg atg gcg gat tac egg atg gaa tec ctg 384 
Phe Asn Ser Thr Thr He Ala Met Ala Asp Tyr Arg Met Glu Ser Leu 
115 120 125 

teg gcg gaa att aac tat gcg gcg gcc aaa ctg gcg cgc gcc tgc gcc 432 
Ser Ala Glu lie Asn Tyr Ala Ala Ala Lys Leu Ala Arg Ala cys Ala 
130 135 140 
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gat gaa tgg acg gcg cga aca cca gaa aaa cca cgc ttt gtt gcg ggc 480 

Asp Glu Trp Thr Ala Arg Thr Pro Glu Lys Pro Arg Phe Val Ala Gly 
145 150 155 160 

gtg ctt ggt cca act aac cgc acg gcc tec att teg ccg gac gtc aac 528 
Val Leu Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro Asp Val Asn 
165 170 175 

gac ccg gcg ttt cgt aat ate ace ttc gat cag ctg gtg gcg gcc tac 576 
Asp Pro Ala Phe Arg Asn He Thr Phe Asp Gin Leu Val Ala Ala Tyr 
180 185 190 

cgt gaa tec ace aaa gcg ctg gtg gaa ggc ggg gcg gac ctg ate ctg 624 
Arg Glu Ser Thr Lys Ala Leu Val Glu Gly Gly Ala Asp Leu He Leu 
195 200 205 

att gaa act gtc ttc gac ace etc aac gcc aaa gcg gcg gtg ttt gcg 672 
He Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala Val Phe Ala 
210 215 220 

gtg aaa gaa gag ttt gaa gcg ctg ggc gtt gat ctg ccg ate atg att 720 
Val Lys Glu Glu Phe Glu Ala Leu Gly Val Asp Leu Pro He Met He 
225 230 235 240 

tec ggc acc ate ace gac gcc tct ggc cgt acg ctt tec ggc cag acg 768 
Ser Gly Thr He Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr 
245 250 255 

acc gaa gcc ttt tat aac teg ctg cgc cac gcc gag gcg etc act ttt 816 
Thr Glu Ala Phe Tyr Asn Ser Leu Arg His Ala Glu Ala Leu Thr Phe 
260 265 270 

ggc ctt aac tgc gcg ctg ggg cca gat gaa ctg cgc cag tac gtc cag 864 
Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg Gin Tyr Val Gin 
275 280 285 

gaa ctg teg egg att gcc gaa tgc tac gtc acc gcg cac ccg aac gcc 912 
Glu Leu Ser Arg He Ala Glu Cys Tyr Val Thr Ala His Pro Asn Ala 
290 295 300 

ggc ctg ccg aac get ttc ggc gag tac gac etc gac gcc gac acc atg 960 
Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Asp Ala Asp Thr Met 
305 310 315 320 

gcg aaa cag att cgc gaa tgg gcg gaa gcg ggc ttc ctg aat ate gtt 1008 
Ala Lys Gin He Arg Glu Trp Ala Glu Ala Gly Phe Leu Asn He Val 
325 330 335 

ggc ggc tgc tgc ggc acc acg ccg gag cat att gcg gcg atg age cgc 1056 
Gly Gly Cys Cys Gly Thr Thr Pro Glu His He Ala Ala Met Ser Arg 
340 345 350 

gcc gtt gcc ggt ttg teg ccg cgc cag ctg ccg gat ate ccg gtg gcc 1104 
Ala Val Ala Gly Leu Ser Pro Arg Gin Leu Pro Asp He Pro Val Ala 
355 360 365 

tgc cgc ctt tec ggc ctg gag ccg ctg aac att ggt gac gat age ctg 1152 
Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn He Gly Asp Asp Ser Leu 
370 375 380 



ttt gtc aac gtc ggc gaa cgt act aac gtc acc ggc teg gcc aaa ttt 
Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala Lys Phe 



1200 
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385 390 395 400 

aaa cgc ttg ate aaa gaa gag aaa tac age gaa gcg ctg gat gtc gee 1248 
Lys Arg Leu He Lys Glu Glu Lye Tyr Ser Glu Ala Leu Asp ,Val Ala 
405 410 415 

cgt cag cag gtc gaa age ggc gcg cag att att gat ate aat atg gat 1296 
Arg Gin Gin Val Glu Ser Gly Ala Gin He lie Asp He Asn Met Asp 
420 425 430 

9*9 999 atg etc gac gee gaa gcg gcg atg £tg cgt ttc etc age ctg 1344 
Glu Gly Met Leu Asp Ala Glu Ala Ala Met Val Arg Phe Leu Ser Leu 
435 440 445 

att gee ggt gag ccg gac att gee cgt gta cca ate atg att gac tec 1392 
He Ala Gly Glu Pro Asp He Ala Arg Val Pro He Met He Asp Ser 
450 455 460 

tec aaa tgg gag gtt ate gaa aaa ggg ctg aag tgc att cag ggt aaa 1440 
Ser Lys Trp Glu Val He Glu LyB Gly Leu Lys Cys He Gin Gly Lys 
465 470 475 480 

ggc ate gtc aac tct att teg atg aaa gag ggc gtg gaa gec ttt att 1488 
Gly He Val Asn Ser He Ser Met Lys Glu Gly Val Glu Ala Phe He 
485 490 495 

cat cat gcg aag ttg eta cgt cgc tac ggc gee gca gtg gtg gtg atg 1536 
His His Ala Lys Leu Leu Arg Arg Tyr Gly Ala Ala Val Val Val Met 
500 505 510 

get ttt gat gag cag ggg cag gee gac ace cgc gaa cgt aaa ate gag 1584 
Ala Phe Asp Glu Gin Gly Gin Ala Asp Thr Arg Glu Arg Lys He Glu 
515 520 ~ 525 

att tgc cgc cgc get tac aaa att ttg etc gaa gag gta ggc ttt ccg 1632 
He Cys Arg Arg Ala Tyr Lys He Leu Leu Glu Glu Val Gly Phe Pro 
530 535 540 

ccg gaa gac ate ate ttc gac ccg aat ate ttc gee gtc gee ace ggt 1680 
Pro Glu Asp He He Phe Asp Pro Asn He Phe Ala Val Ala Thr Gly 
54 5 550 555 560 

att gaa gag cac aac aac tac gcg cag gac ttt ate ggc get tgt gaa 1728 
He Glu Glu His Asn Asn Tyr Ala Gin Asp Phe He Gly Ala Cys Glu 
565 570 575 

gac ate aaa cgc gag ctg ccg cac gcg ctg ate tec ggc ggc gtg tct 1776 
Asp He Lys Arg Glu Leu Pro His Ala Leu He Ser Gly Gly Val Ser 
580 ' 585 590 

aac gtg tec ttc teg ttt cgc ggc aac gac ccg gta cgt gag get ate 1824 
Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Pro Val Arg Glu Ala He 
595 600 605 

cac gcg gta ttc etc tac tac gee ate cgc aac ggc atg gac atg ggc 1872 
His Ala Val Phe Leu Tyr Tyr Ala He Arg Asn Gly Met Asp Met Gly 
610 615 620 

ate gtc aac gee ggg caa ctg gcg att tat gac aac ctg cct gee gaa 1920 
He Val Asn Ala Gly Gin Leu Ala He Tyr Asp Asn Leu Pro Ala Glu 
625 630 635 640 
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ctg cgc gat gca gtt gaa gat gtc att ctt aac cgt cgc gat gac ggc 1968 
Leu Arg Asp Ala Val Glu Asp Val He Leu Asn Arg Arg Asp Asp Gly 
645 650 " 655 

acc gag cgt ttg ctg gat ttg gcg gag aaa tat cgc ggc age aaa acc 2016 
Thr Glu Arg Leu Leu Asp Leu Ala Glu Lys Tyr Arg Gly Ser Lys Thr 
660 665 670 

gac gag get gec agt gec cag cag gcg gaa tgg cgt age tgg gac gtg 2064 
Asp Glu Ala Ala Ser Ala Gin Gin Ala Glu Trp Arg Ser Trp Asp Val 
675 680 685 

aaa aag cgt etc gaa tac teg ctg gtg aaa ggc att acc gag ttt ate 2112 
Lys Lys Arg Leu Glu Tyr Ser Leu Val Lys Gly He Thr Glu Phe He 
690 695 700 

gaa cag gat acc gaa gaa gee cgt cag cag gec gec cgc ccg att gag 2160 
Glu Gin Asp Thr Glu Glu Ala Arg Gin Gin Ala Ala Arg Pro lie Glu 
705 710 715 720 

gtg att gaa ggg ccg ctg atg gac ggc atg aac gtg gtc ggc gac ctg 2208 
Val He Glu Gly Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu 
725 730 735 

ttc ggc gaa ggg aaa atg ttc ctg ccg cag gtg gtg aaa tec get cgc 2256 
Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg 
740 745 750 

gtg atg aaa caa gcg gtg gee tac ctg gag ccg ttt att gaa gee age 2304 
Val Met Lys Gin Ala Val Ala Tyr Leu Glu Pro Phe He Glu Ala Ser 
755 760 765 

aaa gaa aaa ggc tec age aac ggc aag atg gtg att gee acc gtg aag 2352 
Lys Glu Lys Gly Ser Ser Asn Gly Lys Met Val He Ala Thr Val Lys 
770 ■ 775 780 

ggc gat gtg cac gac att ggc aag aac att gtc ggc gtg gtg ctg caa 2400 
Gly Asp Val His Asp He Gly Lys Asn He Val Gly Val Val Leu Gin 
785 790 795 800 

tgc aac aac tac gaa ate gtc gat ctt ggc gtg atg gtg cca gcg gag 2448 
Cys Asn Asn Tyr Glu He Val Asp Leu Gly Val Met Val Pro Ala Glu 
805 810 815 

aaa ate etc aga acg gcg cgt gaa gtg aat gee gat ctg att ggt ctt 2496 
Lys He Leu Arg Thr Ala Arg Glu Val Asn Ala Asp Leu He Gly Leu 
820 825 830 

tec ggg ctt ate acc ccg teg ctg gac gaa atg gtc aac gtg gcg aaa 2544 
Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met Val Asn Val Ala Lys 
835 840 845 

gag atg gag cgt cag ggc ttt act ate ccg eta ctg ate ggc ggc gca 2592 
Glu Met Glu Arg Gin Gly Phe Thr He Pro Leu Leu He Gly Gly Ala 
850 855 860 

acc act teg aaa gcg cat acg gcg gtg aaa ate gag cag aac tac age 2640 
Thr Thr Ser Lys Ala His Thr Ala Val Lys He Glu Gin Asn Tyr Ser 
865 870 875 880 

ggt ccg acg gtc tac gtg cag aat get teg cgt acc gtg ggc gtg gtg 2688 
Gly Pro Thr Val Tyr Val Gin Asn Ala Ser Arg Thr Val Gly Val Val 
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885 890 695 

gcg gcg eta etc tec gac ace cag cgt gat gac ttt gtc gee cgt ace 2736 
Ala Ala Leu Leu Ser Asp Thr Gin Arg Asp Asp Phe Val Ala Arg Thr 
900 905 910 ' 

cgc aaa gag tac gaa ace gtg cgt att cag cac gec cgc aaa aaa ccg 2784 
Arg Lys Glu Tyr Glu Thr Val Arg He Gin His Ala Arg Lys Lys Pro 
915 920 925 

cgc acg ccg ccg gtc acg ctg gaa gcg gcg cgc gat aat gat ctg gca 2832 
Arg Thr Pro Pro Val Thr Leu Glu Ala Ala Arg Asp Asn Asp Leu Ala 
930 935 940 

ttt gat tgg gaa cgc tac ace ccg ccg gta gec cac cgt ctg ggc gtg 2880 
Phe Asp Trp Glu Arg Tyr Thr Pro Pro Val Ala His Arg Leu Gly Val 
945 950 955 960 

cag gag gtg gaa gee age ate gaa acg ctg cgc aac tac ate gac tgg 2928 
Gin Glu Val Glu Ala Ser He Glu Thr Leu Arg Asn Tyr He Asp Trp 
965 970 975 

acg ccg ttc ttt atg acc tgg teg ctg gee ggc aaa tac ccg cgc att 2976 
Thr Pro Phe Phe Met Thr Trp Ser Leu Ala Gly Lys Tyr Pro Arg He 
980 985 990 

ctg gaa gat gag gtg gtg ggc gtt gag gcg cag cgt ctg ttt aaa gac 3024 
Leu Glu Asp Glu Val Val Gly Val Glu Ala Gin Arg Leu Phe Lys Asp 
995 1000 1005 

gec aat gat atg ctg gat aaa ctg age gec gag aaa ctg ttg aat ccg 3072 
Ala Asn Asp Met Leu Asp Lys Leu Ser Ala Glu Lys Leu Leu Asn Pro 
1010 1015 1020 

cgt ggc gtg gtg ggc ctg ttc ccg gcg aac cgt gtg ggt gac gac ate 3120 
Arg Gly Val Val Gly Leu Phe Pro Ala Asn Arg Val Gly Asp Asp He 
1Q 25 1030 1035 1040 

gaa ate tat cgc gac gaa acc cgt act cat gtt ctg acg gtc age cac 3168 
Glu He Tyr Arg Asp Glu Thr Arg Thr His Val Leu Thr Val Ser His 
1045 1050 1055 

cac ctg cgc cag cag acc gag aaa gtt ggt ttt get aac tac tgt ctg 3216 
His Leu Arg Gin Gin Thr Glu Lys Val Gly Phe Ala Asn Tyr Cys Leu 
1060 1065 1070 

gcg gat ttt gtc gcg ccg aaa ctg age ggc aaa gcg gac tac ate ggt 3264 
Ala Asp Phe Val Ala Pro Lys Leu Ser Gly Lys Ala Asp Tyr He Gly 
1075 1080 1085 

get ttc gcg gtg acc ggc ggt ctg aag gag gat gcg ctg gcg gac gee 3312 
Ala Phe Ala Val Thr Gly Gly Leu Lys Glu Asp Ala Leu Ala Asp Ala 
1090 1095 lioo 

ttc gaa gcg caa cac gac gac tat aac aag ate atg gtg aaa gcg att 3360 
Phe Glu Ala Gin His Asp Asp Tyr Asn Lys lie Met Val Lys Ala lie 
11Q 5 1110 1115 1120 

gec gac cgt ctg gcg gaa gcg ttt gee gag tat ctg cat gag cgt gta 3408 
Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr Leu His Glu Arg Val 
H25 1130 H35 
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cgt aag gtt tac tgg gga tat gcg ccg aac gag age ctg agt aac gac 3456 
Arg Lys Val Tyr Trp Gly Tyr Ala Pro Asn Glu Ser Leu Ser Asn Asp 
1140 1145 1150 

gaa tta ate cgc gaa aac tac cag ggg att cgc ccg gcg ccg ggt tat 3504 
Glu Leu He Arg Glu Asn Tyr Gin Gly He Arg Pro Ala Pro Gly Tyr 
1155 1160 1165 

cct gec tgc ccg gaa cat acc gaa aaa ggc act ate tgg cag eta ctg 3552 
Pro Ala Cys Pro Glu His Thr Glu Lys Gly Thr He Trp Gin Leu Leu 
1170 1175 H80 

gat gtc gaa aaa cac acc ggg atg aag etc acc gaa tct ttc gee atg 3600 
Asp Val Glu Lys His Thr Gly Met Lys Leu Thr Glu Ser Phe Ala Met 
1185 1190 H95 1200 

tgg cct ggc gcg teg gtc tec ggc tgg tac ttc age cat cct gag age 3648 
Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe Ser His Pro Glu Ser 
1205 1210 1215 

aaa tac ttc gcg gta gcg cag ate caa cgc gat cag gtg aca gat tat 3696 
Lys Tyr Phe Ala Val Ala Gin He Gin Arg Asp Gin Val Thr Asp Tyr 
1220 1225 1230 

get ttc cgt aaa gga atg. age gtt gag gac gtt gag egg tgg etc gcg 3744 
Ala Phe Arg Lys Gly Met Ser Val Glu Asp Val Glu Arg Trp Leu Ala 
1235 1240 1245 

ccg aac ctg ggt tac gat gcg gac tga 3771 
Pro Asn Leu Gly Tyr Asp Ala Asp 
1250 1255 



<210> 24 
<211> 1256 
<212> PRT 

<213> Salmonella typhi 
<400> 24 

Met Ser His Val Ala Arg Cys Ser Leu Phe Arg Gin His Ala Leu Cys 
1 5 10 15 

Gin Tyr Gly Ser Leu Arg Gly Ala Leu Ser Gly Ala Ser Val Ser Ser 
20 25 30 

Lys Val Glu Gin Leu Arg Ala Gin Leu Asn Glu Arg He Leu Val Leu 
35 40 45 

Asp Gly Gly Met Gly Thr Met He Gin Ser Tyr Arg Leu His Glu Glu 
50 55 60 

Asp Phe Arg Gly Glu Arg Phe Ala Asp Trp Pro Cys Asp Leu Lys Gly 
65 70 75 80 

Asn Asn Asp Leu Leu Val Leu Ser Lys Pro Glu Val He Ala Ala He 
85 90 95 

His Asn Ala Tyr Phe Glu Ala Gly Ala A B p He He Glu Thr Asn Thr 
100 105 no 

Phe Asn Ser Thr Thr He Ala Met Ala Asp Tyr Arg Met Glu Ser Leu 
115 120 125 
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Ser Ala Glu He Asn Tyr Ala Ala Ala Lys Leu Ala Arg Ala Cys Ala 
130 135 140 

Asp Glu Trp Thr Ala Arg Thr Pro Glu Lys Pro Arg Phe Val Ala Gly 
1« 150 155 , 160 

Val Leu Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro Asp Val Asn 
165 170 175 

Asp Pro Ala Phe Arg Asn He Thr Phe Asp Gin Leu Val fdi Ala Tyr 
180 185 190 

Arg Glu Ser Thr Lys Ala Leu Val Glu Gly Gly Ala Asp Leu He Leu 
195 200 205 

He Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala Val Phe Ala 
210 215 220 

Val Lys Glu Glu Phe Glu Ala Leu Gly Val Asp Leu Pro He Met He 
225 230 235 240 

Ser Gly Thr He Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr 
245 250 255 

Thr Glu Ala Phe Tyr Asn Ser Leu Arg His Ala Glu Ala Leu Thr Phe 
260 265 270 

Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg Gin Tyr Val Gin 
275 280 285 

Glu Leu Ser Arg He Ala Glu Cys Tyr Val Thr Ala His Pro Asn Ala 
290 295 300 

Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Asp Ala Asp Thr Met 
305 310 " 315 320 

Ala Lys Gin He Arg Glu Trp Ala Glu Ala Gly Phe Leu Asn He Val 
325 330 335 

Gly Gly Cys Cys Gly Thr Thr Pro Glu His He Ala Ala Met Ser Arg 
340 345 350 

Ala Val Ala Gly Leu Ser Pro Arg Gin Leu Pro Asp He Pro Val Ala 
355 360 365 

Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn He Gly Asp Asp Ser Leu 
370 375 380 

Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala Lys Phe 
385 390 395 400 

Lys Arg Leu He Lys Glu Glu Lys Tyr Ser Glu Ala Leu Asp Val Ala 
405 410 415 

Arg Gin Gin Val Glu Ser Gly Ala Gin He He Asp He Asn Met Asp 
420 425 430 

Glu Gly Met Leu Asp Ala Glu Ala Ala Met Val Arg Phe Leu Ser Leu 
435 440 445 

He Ala Gly Glu Pro Asp lie Ala Arg Val Pro He Met He Asp Ser 
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450 455 460 

Ser Lys Trp Glu Val He Glu Lys Gly Leu Lys Cys He Gin Gly Lys 
465 470 475 480 

Gly He Val Asn Ser He Ser Met Lys Glu Gly Val Glu Ala Phe He 
485 490 495 

His His Ala Lys Leu Leu Arg Arg Tyr Gly Ala Ala Val Val Val Met 
500 505 510 

Ala Phe Asp Glu Gin Gly Gin Ala Asp Thr Arg Glu Arg Lys He Glu 
515 520 . 525 

He Cys Arg Arg Ala Tyr Lys He Leu Leu Glu Glu Val Gly Phe Pro 
530 535 540 

Pro Glu Asp lie He Phe Asp Pro Asn He Phe Ala Val Ala Thr Gly 
545 550 555 560 

He Glu Glu His Asn Asn Tyr Ala Gin Asp Phe He Gly Ala CyB Glu 
565 570 575 

Asp lie Lys Arg Glu Leu Pro His Ala Leu He Ser Gly Gly Val Ser 
580 585 590 

Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Pro Val Arg Glu Ala He 
595 600 605 

His Ala Val Phe Leu Tyr Tyr Ala lie Arg Asn Gly Met Asp Met Gly 
610 615 620 

lie Val Asn Ala Gly Gin Leu Ala lie Tyr Asp Asn Leu Pro Ala Glu 
625 630 635 640 

Leu Arg Asp Ala Val Glu Ab P Val lie Leu Asn Arg Arg Asp Asp Gly 
64 5 650 655 

Thr Glu Arg Leu Leu Asp Leu Ala Glu Lys Tyr Arg Gly Ser Lys Thr 
660 665 670 

Asp Glu Ala Ala Ser Ala Gin Gin Ala Glu Trp Arg Ser Trp Asp Val 
675 680 685 

Lys Lys Arg Leu Glu Tyr Ser Leu Val Lys Gly lie Thr Glu Phe lie 
690 695 700 

Glu Gin Asp Thr Glu Glu Ala Arg Gin Gin Ala Ala Arg Pro lie Glu 
70S 710 715 720 

Val lie Glu Gly Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu 
725 730 735 

Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg 
7 40 745 750 

Val Met Lys Gin Ala Val Ala Tyr Leu Glu Pro Phe lie Glu Ala Ser 
755 760 765 

Lys Glu Lys Gly Ser Ser Asn Gly Lys Met Val lie Ala Thr Val Lys 
770 775 780 
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Gly Asp Val His Asp He Gly Lys Asn He Val Gly Val Val Leu Gin 
785 ? 90 795 800 

Cys Asn Asn Tyr Glu He Val Asp Leu Gly Val Met Val Pro Ala Glu 
805 810 *815 

Lys He Leu Arg Thr Ala Arg Glu Val Asn Ala Asp Leu He Gly Leu 
820 825 830 

Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met Val Asn VaJ. Ala Lys 
835 840 845 

Glu Met Glu Arg Gin Gly Phe Thr He Pro Leu Leu He Gly Gly Ala 
850 855 860 

Thr Thr Ser Lys Ala His Thr Ala Val Lys lie Glu Gin Asn Tyr Ser 
865 870 875 680 

Gly Pro Thr Val Tyr Val Gin Asn Ala Ser Arg Thr Val Gly Val Val 
885 890 895 

Ala Ala Leu Leu Ser Asp Thr Gin Arg Asp Asp Phe Val Ala Arg Thr 
900 905 910 

Arg Lys Glu Tyr Glu Thr Val Arg lie Gin His Ala Arg Lys Lys Pro 
515 920 925 

Arg Thr Pro Pro Val Thr Leu Glu Ala Ala Arg Asp Asn Asp Leu Ala 
930 935 940 

Phe Asp Trp Glu Arg Tyr Thr Pro Pro Val Ala His Arg Leu Gly Val 
945 950 955 960 

Gin Glu Val Glu Ala Ser He Glu Thr Leu Arg Asn Tyr He Asp Trp 
965 970 975 

Thr Pro Phe Phe Met Thr Trp Ser Leu Ala Gly Lys Tyr Pro Arg He 
980 985 990 

Leu Glu Asp Glu Val Val Gly Val Glu Ala Gin Arg Leu Phe Lys Asp 
995 1000 1005 

Ala Asn Asp Met Leu Asp Lys Leu Ser Ala Glu Lys Leu Leu Asn Pro 
1010 1015 1020 

Arg Gly Val Val Gly Leu Phe Pro Ala Asn Arg Val Gly Asp Asp He 
1025 1030 1035 1040 

Glu He Tyr Arg Asp Glu Thr Arg Thr His Val Leu Thr Val Ser His 
!045 1050 1055 

His Leu Arg Gin Gin Thr Glu Lys Val Gly Phe Ala Asn Tyr Cys Leu 
1060 1065 1070 

Ala Asp Phe Val Ala Pro Lys Leu Ser Gly Lys Ala Asp Tyr lie Gly 
!075 1080 1085 

Ala Phe Ala Val Thr Gly Gly Leu Lys Glu Asp Ala Leu Ala Asp Ala 
1090 1095 iioo 

Phe Glu Ala Gin His Asp Asp Tyr Asn Lys He Met Val Lys Ala lie 
1105 mo ins ' 1120 
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Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr Leu His Glu Arg Val 
1125 1130 ' H35 

Arg Lys Val Tyr Trp Gly Tyr Ala Pro Asn Glu Ser Leu Ser Asn Asp 
1140 1145 H50 

Glu Leu lie Arg Glu Asn Tyr Gin Gly lie Arg Pro Ala Pro Gly Tyr 
1155 1160 * H65 

Pro Ala Cys Pro Glu His Thr Glu Lys Gly Thr He Trp Gin Leu Leu 
1170 H75 1180 

Asp Val Glu Lys His Thr Gly Met Lys Leu Thr Glu Ser Phe Ala Met 
1185 1190 1195 1200 

Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe Ser His Pro Glu Ser 
1205 1210 1215 

Lys Tyr Phe Ala Val Ala Gin He Gin Arg Asp Gin Val Thr Asp Tyr 
1220 1225 1230 

Ala Phe Arg Lys Gly Met Ser Val Glu Asp Val Glu Arg Trp Leu Ala 
1235 1240 1245 

Pro Asn Leu Gly Tyr Asp Ala Asp 
1250 1255 



<210> 25 
<211> 3711 
<212> DNA 

<213> Pseudomonas fluorescens 

<220> 

<221> CDS 

<222> (1) . . (3708) 

<223> RPU03563 

<400> 25 

atg tec gat cgc age gtc cgc ctt caa get etc aag caa get etc aaa 48 

Met Ser Asp Arg ser Val Arg Leu Gin Ala Leu Lys Gin Ala Leu Lys 
1 5 10 is 



gag cgc ate ctg att etc gac ggc ggc atg ggc acg atg ate cag age 
Glu Arg He Leu He Leu Asp Gly Gly Met Gly Thr Met He Gin Ser 
20 25 30 



96 



tac aag etc gaa gag cag gat tat cgc ggc aaa cgc ttc gec gac tgg 144 
Tyr Lys Leu Glu Glu Gin Asp Tyr Arg Gly Lys Arg Phe Ala Asp Trp 
35 40 " 45 

ccg age gac gtc aag ggc aac aac gac ctg ttg gtg ctg acc cgc ccg 192 
Pro Ser Asp Val Lys Gly Asn Asn Asp Leu Leu Val Leu Thr Arg Pro 
50 55 60 



gac gtg ate ggc ggc ate gag aaa gee tat ctg gat gee ggt gee gac 
Asp Val He Gly Gly He Glu Lys Ala Tyr Leu Asp Ala Gly Ala Asp 
fi 5 70 75 80 



ate etc gag acc aac acc ttc aac gee acg cag att tec atg gee gac 
He Leu Glu Thr Asn Thr Phe Asn Ala Thr Gin He Ser Met Ala Asp 



240 



288 
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65 90 95 

tac ggc atg gaa gaa ctg gtc tac gaa etc aac gta gaa ggc gec cgt 336 
Tyr Gly Met Glu Glu Leu Val Tyr Glu Leu Asn Val Glu Gly ,Ala Arg 
100 105 110 

ctg gca cgc aag gtc gec gac gcg aaa acc etc gag acc ccc gac aag 384 
Leu Ala Arg Lys Val Ala Asp Ala Lys Thr Leu Glu Thr Pro Asp Lys 
115 120 125 

ccg cgc ttc gtc gee ggc gtt etc ggc ccg acc age cgc acc tgc teg 432 
Pro Arg Phe Val Ala Gly Val Leu Gly Pro Thr Ser Arg Thr Cys Ser 
130 135 140 

ctg teg ccg gac gtc aac aac ccg ggc tat cgc aac gtc acc ttc gat 480 
Leu Ser Pro Asp Val Asn Asn Pro Gly Tyr Arg Asn Val Thr Phe Asp 
145 150 155 160 

gag ctg gtc gaa aac tac acc gag gee acc aaa ggc ctg ate gag ggc 528 
Glu Leu Val Glu Asn Tyr Thr Glu Ala Thr Lys Gly Leu He Glu Gly 
165 170 175 

ggc gcg gat ctg ate ctg ate gaa acc ate ttc gac acc etc aac gee 576 
Gly Ala Asp Leu He Leu He Glu Thr He Phe Asp Thr Leu Asn Ala 
180 185 190 

aaa gee gcg ate ttc gee gtg caa ggc gtg ttc gaa gaa ctg ggc ttc 624 
Lys Ala Ala He Phe Ala Val Gin Gly Val Phe Glu Glu Leu Gly Phe 
195 200 205 

gaa ttg ccg ate atg ate tec ggc acc ate acc gac gee tec ggc cgt 672 
Glu Leu Pro He Met He Ser Gly Thr He Thr Asp Ala Ser Gly Arg 
210 215 220 

acc ctg teg ggc cag acc acc gaa gcg ttc tgg aac tec gtg get cac 720 
Thr Leu Ser Gly Gin Thr Thr Glu Ala Phe Trp Asn Ser Val Ala His 
225 230 235 240 

gec aaa ccg att tec gtc ggt ctt aac tgc gee etc ggc gec cgc gaa 768 
Ala Lys Pro lie Ser Val Gly Leu Asn Cys Ala Leu Gly Ala Arg Glu 
245 250 255 

ctg cgt ccg tac ctg gaa gag ctg teg gac aag gec age acc cac gtt 816 
Leu Arg Pro Tyr Leu Glu Glu Leu Ser Asp Lys Ala Ser Thr His Val 
260 265 270 

teg gcg cac ccg aac gee ggc ctg ccg aac gaa ttc ggc gag tac gac 864 
Ser Ala His Pro Asn Ala Gly Leu Pro Asn Glu Phe Gly Glu Tyr Asp 
275 280 285 

gag ctg ccg gtg gac acc gee aag gtc ate gaa gag ttc gee cag age 912 
Glu Leu Pro Val Asp Thr Ala Lys Val lie Glu Glu Phe Ala Gin Ser 
290 295 300 

ggt ttc etc aac ate gtc ggc ggt tgc tgc ggc acc acg ccg ggc cat 960 
Gly Phe Leu Asn He Val Gly Gly Cys Cys Gly Thr Thr Pro Gly His 
305 310 315 320 



ate gaa gec ate gee aaa gee gtt gec ggt tac gcg cca egg cag att 
He Glu Ala He Ala Lys Ala Val Ala Gly Tyr Ala Pro Arg Gin He 
325 330 335 



1008 
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ccg gac att ccc aag gcc tgc cgc ctg teg ggt ctg gaa ccg ttc acc 1056 
Pro Asp He Pro Lys Ala Cys Arg Leu Ser Gly Leu Glu Pro Phe Thr 
340 345 350 

att gat cgc age teg ctg ttc gtc aac gtc ggc gag egg acc aac ate 1104 
He Asp Arg Ser Ser Leu Phe Val Asn Val Gly Glu Arg Thr Asn He 
355 360 ' 365 

acc ggg tec gcg aaa ttt gcc egg ctg ate cgt gaa gac aac tac acc 1152 
Thr Gly Ser Ala Lys Phe Ala Arg Leu He Arg Glu Asp Asn Tyr Thr 
370 375 380 

gaa gcc ctg gaa gtc gcc ctg cag cag gtc gag gcc ggc gcc cag gtg 1200 
Glu Ala Leu Glu Val Ala Leu Gin Gin Val Glu Ala Gly Ala Gin Val 
3fi 5 390 395 400 

ate gac ate aac atg gac gaa ggg atg etc gat teg aag aag gcc atg 1248 
He Asp He Asn Met Asp Glu Gly Met Leu Asp Ser Lys Lys Ala Met 
405 410 415 

gtg acc ttc etc aat ctg att gcc ggc gaa ccg gac ate tec cgc gta 1296 
Val Thr Phe Leu Asn Leu He Ala Gly Glu Pro Asp He Ser Arg Val 
420 425 430 

ccg ate atg ate gac tec teg aaa tgg gac gtg ate gaa gcc ggc etc 1344 
Pro He Met He Asp Ser Ser Lys Trp Asp Val He Glu Ala Gly Leu 
435 440 445 

aag tgc att cag ggc aag ggc ate gtc aac teg ate age atg aaa gaa 1392 
Lys Cys He Gin Gly Lys Gly He Val Asn Ser He Ser Met Lys Glu 
450 455 460 

ggc gtc gag cag ttc ate cac cac gcc aaa ctg tgc aag cgc tat ggc 1440 
Gly Val Glu Gin Phe He His His Ala Lys Leu Cys Lys Arg Tyr Gly 
465 470 475 480 

gcc gcc gtg gtg gtg atg gcg ttc gac gaa gcc ggc cag get gac acc 1488 
Ala Ala Val Val Val Met Ala Phe Asp Glu Ala Gly Gin Ala Asp Thr 
485 490 495 

gaa gcg cgc aag aaa gag ate tgc aaa cgc tec tac gac att ctg gtc 1536 
Glu Ala Arg Lys Lys Glu He Cys Lys Arg Ser Tyr Asp He Leu Val 
500 505 510 

aac gaa gtc ggc ttc ccg ccg gaa gac ate att ttc gac ccg aac ate 1584 
Asn Glu Val Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Asn He 
515 520 525 

ttc gcc gtg gcc acc ggc ate gaa gaa cac aac aac tac get gtg gac 1632 
Phe Ala Val Ala Thr Gly He Glu Glu His Asn Asn Tyr Ala Val Asp 
530 535 540 

ttc ate aac gcc tgt gcc tac ate cgc gac gag ctg ccg tat gcc ctg 1680 
Phe He Asn Ala Cys Ala Tyr He Arg Asp Glu Leu Pro Tyr Ala Leu 
545 550 555 560 

age tec ggc ggc gtg tec aac gtg teg ttc teg ttc cgc ggc aac aac 1728 
Ser Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asn 
565 570 575 

ccg gtg cgc gag gcg ate cac teg gtg ttc ctg ctg tac gcg ate cgc 1776 
Pro Val Arg Glu Ala He His Ser Val Phe Leu Leu Tyr Ala He Arg 
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580 585 590 

gcc ggc ctg acc atg ggt ate gtc aac gcc ggt cag ctg gag ate tac 1824 
Ala Gly Leu Thr Met Gly He Val Asn Ala Gly Gin Leu Glu He Tyr 
595 600 60S 

gac cag ate ccg cag gaa ctg cgc gac gcc gtt gaa gac gtg ate etc 1872 
Asp Gin He Pro Gin Glu Leu Arg Asp Ala Val Glu Asp Val He Leu 
610 615 620 

aac cgc acg ccg gaa ggc acc gac gcc etc etc gcc ate gcc gac aag 1920 
Asn Arg Thr Pro Glu Gly Thr Asp Ala Leu Leu Ala He Ala Asp Lys 
62 5 630 635 640 

tac aag ggc gac ggc age gtc aag gaa gcc gag acc gaa gaa tgg cgc 1968 
Tyr Lys Gly Asp Gly Ser Val Lys Glu Ala Glu Thr Glu Glu Trp Arg 
645 650 655 

ggc tgg gac gtc aac aaa cgt ctg gaa cat gcg ctg gtc aag ggc ate 2016 
Gly Trp Asp Val Asn Lys Arg Leu Glu His Ala Leu Val Lys Gly He 
660 665 670 

acc acc cac ate gtc gaa gac acc gaa gaa tec cgt cag tec ttc gcc 2064 
Thr Thr His He Val Glu Asp Thr Glu Glu Ser Arg Gin Ser Phe Ala 
675 680 685 

cgc ccg ate gaa gtg ate gaa ggc ccg ctg atg tec ggc atg aac ate 2112 
Arg Pro He Glu Val He Glu Gly Pro Leu Met Ser Gly Met Asn He 
690 695 700 

gtc ggc gac ctg ttc ggc gcc ggc aaa atg ttc ctg ccg caa gtg gtg 2160 
Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu Pro Gin Val Val 
70S 710 715 720 

aaa tec gcc cgc gtg atg aag cag gcc gtg gcg cac ctg att ccg ttc 2206 
Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala His Leu He Pro Phe 
725 730 735 

ate gaa ctg gaa aaa ggc gac aag ccg gaa gcc aag ggc aag ate ctg 2256 
He Glu Leu Glu Lys Gly Asp Lys Pro Glu Ala Lys Gly Lys He Leu 
740 745 750 

atg gcc acg gtc aaa ggc gac gtg cac gac ate ggc aag aac ate gtc 2304 
Met Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val 
755 760 765 

ggc gtg gtg ctg ggt tgc aac ggc tac gac ate gtc gac etc ggc gtg 2352 
Gly Val Val Leu Gly Cys Asn Gly Tyr Asp He Val Asp Leu Gly Val 
770 775 780 

atg gtg ccg gcg gag aag ate ctg cag gtg gcc aag gag cag aag tgc 2400 
Met Val Pro Ala Glu Lys He Leu Gin Val Ala Lys Glu Gin Lys Cys 
785 790 795 800 

gac ate ate ggc ctg tec ggt ctg ate acc ccg teg ctg gat gag atg 2448 
Asp He He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met 
80S 810 815 

gtc cat gtg gcc cgc gag atg cag cgc cag gac ttc cac ctg ccg ctg 2496 
Val His Val Ala Arg Glu Met Gin Arg Gin Asp Phe His Leu Pro Leu 
820 825 830 
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tit ti JS 2? 9 ? 9 aCC aCC tCC 339 9C 9 cac ac 9 9 C 9 9tg aag ate 2544 
Met lie Gly Gly Ala Thr Thr Ser Lys Ala Hie Thr Ala Val Lys He 
835 840 845 

gag ccc aag tac age aac gac gca gtg gtc tac gtg acc gac gec tec 2592 
Glu Pro Lys Tyr Ser Asn Asp Ala Val Val Tyr Val Thr Asp Ala Ser 
850 855 860 

cgc gee gtg ggc gtg gcg acg cag ttg ctg tec aag gaa ctg aaa gee 2640 
Arg Ala Val Gly Val Ala Thr Gin Leu Leu Ser Lys Glu Leu Lys Ala 
865 870 8 75 880 

ggt ttc gtc cag aag acc cgc gaa gag tac ate gac gtc cgc gag cgc 2688 
Gly Phe Val Gin Lys Thr Arg Glu Glu Tyr He Asp Val Arg Glu Arg 
885 890 895 

acc gee aac cgc age gee cgc acc gaa cgc ctg age tac gee gee gcg 2736 
Thr Ala Asn Arg Ser Ala Arg Thr Glu Arg Leu Ser Tyr Ala Ala Ala 
900 90S 910 

ate gee aag aag ccg cag ttc gac tgg gee act tac acc ccg gtc aaa 2784 
He Ala Lys Lys Pro Gin Phe Asp Trp Ala Thr Tyr Thr Pro Val Lys 
915 920 925 

ccg acc ttc acc ggc acc cgc gtg ctg gac aac ate gac etc aac gtt 2832 
Pro Thr Phe Thr Gly Thr Arg Val Leu Asp Asn He Asp Leu Asn Val 
930 935 940 

etc gee gag tac ate gac tgg acg ccg ttc ttc ate tec tgg gac ctg 2860 
Leu Ala Glu Tyr He Asp Trp Thr Pro Phe Phe He Ser Trp Asp Leu 
945 950 955 960 

gee ggc aag ttc ccg cgc ate etc gaa gac gaa gtg gtc ggc gaa gcg 2928 
Ala Gly Lys Phe Pro Arg He Leu Glu Asp Glu Val Val Gly Glu Ala 
965 970 975 

gcg acc gcg ctg tac aag gac get cgc gag atg ctg acc aag ctg ate 2976 
Ala Thr Ala Leu Tyr Ly B Asp Ala Arg Glu Met Leu Thr Lys Leu He 
980 985 990 

gac gag aaa ctg ate age gee cgt gcg gtg ttc ggc ttc tgg ccg gee 3024 
Asp Glu Lys Leu He Ser Ala Arg Ala Val Phe Gly Phe Trp Pro Ala 
995 1000 1005 

aat cag gtg cac gac gac gat ate gag ctg tac ggc gat gac ggc aag 3072 
Asn Gin Val His Asp Asp Asp He Glu Leu Tyr Gly Asp Asp Gly Lys 
1Q 10 1015 1020 

cca atg gcg cgc ctg cat cac ctg cgc cag cag ate ate aag acc gac 3120 
Pro Met Ala Arg Leu His His Leu Arg Gin Gin He He Lys Thr Asp 
1025 1030 1035 1040 

ggc aaa ccg aac ttc tec etc gee gac ttc gtc gcg ccg aag gac age 3168 
Gly Lys Pro Asn Phe Ser Leu Ala Asp Phe Val Ala Pro Ly B Asp Ser 
10 45 1050 1055 

gaa gtg acc gac tac gtt ggt ggt ttc ate acc acc gee ggg ate ggc 3216 
Glu Val Thr Asp Tyr Val Gly Gly Phe He Thr Thr Ala Gly He Gly 
1060 1065 1070 

gec gaa gaa gtg gec aag gee tat cag gac gee ggc gac gat tac aac 3264 
Ala Glu Glu Val Ala Lys Ala Tyr Gin Asp Ala Gly Asp Asp Tyr Asn 
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1075 1080 1085 

teg ate atg gtc aag gec ctg gec gac cgt ctg gec gag gcg tgc gec 3312 
Ser He Met Val Lys Ala Leu Ala Asp Arg Leu Ala Glu Ala Cys Ala 
1090 1095 1100 

gag tgg ctg cac cag cag gtg cgc aaa gag cac tgg ggt tac gec aag 3360 
Qlu Trp Leu His Gin Gin Val Arg Lys Glu His Trp Gly Tyr Ala Lys 
H05 1110 ins H20 

gat gaa gec etc gat aac gag gcg ctg ate aaa gag cag tat tec ggc 3408 
Asp Glu Ala Leu Asp Asn Glu Ala Leu He Lys Glu Gin Tyr Ser Gly 
H25 H30 - H35 

ate cgc cct gec ccc ggc tac ccg gcg tgc ccg gat cac ace gag aag 3456 
He Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp His Thr Glu Lys 
1140 H45 1150 

gee acc ctg ttc gee ctg etc gac cct gaa gca cag gaa atg cgc gec 3504 
Ala Thr Leu Phe Ala Leu Leu Asp Pro Glu Ala Gin Glu Met Arg Ala 
1155 1160 H65 

ggc cgc age ggt gtg ttc etc acc gag cac tac gcg atg ttc ccg gcg 3552 
Gly Arg Ser Gly Val Phe Leu Thr Glu His Tyr Ala Met Phe Pro Ala 
H70 H75 H80 

gca gec gtc age ggc tgg tac ttc gee cat ccg cag gcg cag tac ttc 3600 
Ala Ala Val Ser Gly Trp Tyr Phe Ala His Pro Gin Ala Gin Tyr Phe 
H85 1190 H95 1200 

gec gtg ggc aag gtc gac aag gat cag gtg cag age tac acc teg cgc 3648 
Ala Val Gly Lys Val Asp Lys Asp Gin Val Gin Ser Tyr Thr Ser Arg 
1205 1210 1215 

aaa ggc cag gaa ctg age ctg acc gag cgc tgg ctg gca ccc aat ctg 3696 
Lys Gly Gin Glu Leu Ser Leu Thr Glu Arg Trp Leu Ala Pro Asn Leu 
1220 1225 1230 

ggc tac gac aac tga 3711 
Gly Tyr Asp Asn 
1235 



<210> 26 
<211> 1236 
<212> PRT 

<213> Pseudomonas fluorescens 
<400> 26 

Met Ser Asp Arg Ser Val Arg Leu Gin Ala Leu Lys Gin Ala Leu Lys 
1 5 io 15 

Glu Arg lie Leu He Leu Asp Gly Gly Met Gly Thr Met He Gin Ser 
20 25 30 

Tyr Lys Leu Glu Glu Gin Asp Tyr Arg Gly Lys Arg Phe Ala Asp Trp 
35 40 45 

Pro Ser Asp Val Lys Gly Asn Asn Asp Leu Leu Val Leu Thr Arg Pro 
50 55 60 

Asp Val He Gly Gly He Glu Lys Ala Tyr Leu Asp Ala Gly Ala Asp 
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65 70 75 80 

lie Leu Glu Thr Asn Thr Phe Asn Ala Thr Gin He Ser Met Ala Asp 
85 * 90 95 

Tyr Gly Met Glu Glu Leu Val Tyr Glu Leu Asn Val Glu Gly Ala Arg 
100 105 no 

Leu Ala Arg Lys Val Ala Asp Ala LyB Thr Leu Glu Thr Pro Asp Lys 
115 120 125 

Pro Arg Phe Val Ala Gly Val Leu Gly Pro Thr Ser Arg Thr Cys Ser 
130 135 140 

Leu Ser Pro Asp Val Asn Asn Pro Gly Tyr Arg Asn Val Thr Phe Asp 
"5 150 155 160 

Glu Leu Val Glu Asn Tyr Thr Glu Ala Thr Lys Gly Leu He Glu Gly 
165 170 175 

Gly Ala Asp Leu He Leu He Glu Thr He Phe Asp Thr Leu Asn Ala 
180 185 190 

Lys Ala Ala He Phe Ala Val Gin Gly Val Phe Glu Glu Leu Gly Phe 
195 200 205 

Glu Leu Pro He Met He Ser Gly Thr He Thr Asp Ala Ser Gly Arg 
210 215 220 

Thr Leu Ser Gly Gin Thr Thr Glu Ala Phe Trp Asn Ser Val Ala His 
225 230 235 240 

Ala Lys Pro He Ser Val Gly Leu Asn Cys Ala Leu Gly Ala Arg Glu 
245 250 * 255 

Leu Arg Pro Tyr Leu Glu Glu Leu Ser Asp Lys Ala Ser Thr His Val 
260 265 270 

Ser Ala His Pro Asn Ala Gly Leu Pro Asn Glu Phe Gly Glu Tyr Asp 
275 280 285 

Glu Leu Pro Val Asp Thr Ala Lys Val He Glu Glu Phe Ala Gin Ser 
290 295 300 

Gly Phe Leu Asn He Val Gly Gly Cys Cys Gly Thr Thr Pro Gly His 
305 310 315 320 

He Glu Ala He Ala Lys Ala Val Ala Gly Tyr Ala Pro Arg Gin He 
325 330 335 

Pro Asp He Pro Lys Ala Cys Arg Leu Ser Gly Leu Glu Pro Phe Thr 
340 345 350 

He Asp Arg Ser Ser Leu Phe Val Asn Val Gly Glu Arg Thr Asn He 
355 360 365 

Thr Gly Ser Ala Lys Phe Ala Arg Leu He Arg Glu Asp Asn Tyr Thr 
370 375 380 

Glu Ala Leu Glu Val Ala Leu Gin Gin Val Glu Ala Gly Ala Gin Val 
385 390 395 400 
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He Asp He Asn Met Asp Glu Gly Met Leu Asp Ser Lys Lys Ala Met 
405 410 415 

Val Thr Phe Leu Asn Leu He Ala Gly Glu Pro Asp He Ser ,Arg Val 
420 425 430 

Pro He Met He Asp Ser Ser Lys Trp Asp Val He Glu Ala Gly Leu 
435 440 445 

Lys Cys He Gin Gly Lys Gly He Val Asn Ser He Ser Met; Lys Glu 
450 455 460 

Gly Val Glu Gin Phe He His His Ala Lys Leu Cys Lys Arg Tyr Gly 
46 5 470 475 480 

Ala Ala Val Val Val Met Ala Phe Asp Glu Ala Gly Gin Ala Asp Thr 
485 490 495 

Glu Ala Arg Lys Lys Glu He Cys Lys Arg Ser Tyr Asp He Leu Val 
500 505 510 

Asn Glu Val Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Asn He 
515 520 525 

Phe Ala Val Ala Thr Gly He Glu Glu His Asn Asn Tyr Ala Val Asp 
530 535 540 

Phe He Asn Ala Cys Ala Tyr He Arg Asp Glu Leu Pro Tyr Ala Leu 
545 550 555 560 

Ser Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asn 
565 570 575 

Pro Val Arg Glu Ala He His Ser Val Phe Leu Leu Tyr Ala He Arg 
580 585 590 

Ala Gly Leu Thr Met Gly He Val Asn Ala Gly Gin Leu Glu He Tyr 
595 600 605 

Asp Gin He Pro Gin Glu Leu Arg Asp Ala Val Glu Asp Val He Leu 
610 615 620 

Asn Arg Thr Pro Glu Gly Thr Asp Ala Leu Leu Ala He Ala Asp Lys 
625 630 635 640 

Tyr Lys Gly Asp Gly Ser Val Lys Glu Ala Glu Thr Glu Glu Trp Arg 
645 650 6S5 

Gly Trp Asp Val Asn Lys Arg Leu Glu His Ala Leu Val Lys Gly He 
660 665 670 

Thr Thr His He Val Glu Asp Thr Glu Glu Ser Arg Gin Ser Phe Ala 
675 680 685 

Arg Pro He Glu Val He Glu Gly Pro Leu Met Ser Gly Met Asn He 
690 695 700 

Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu Pro Gin Val Val 
705 710 715 720 

Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala His Leu He Pro Phe 
725 730 735 
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lie Glu Leu Glu Lys Gly Asp Lys Pro Glu Ala Lys Gly Lys lie Leu 
740 745 S 750 

Met Ala Thr Val LyB Gly Asp Val His Asp lie Gly Lys Asn lie Val 
755 760 765 

Gly Val Val Leu Gly Cys Asn Gly Tyr Asp lie Val Asp Leu Gly Val 
770 775 780 

Met Val Pro Ala Glu Lys lie Leu Gin Val Ala Lys Glu Gin Lys Cys 
7B5 790 795 800 

Asp lie He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met 
805 810 815 

Val His Val Ala Arg Glu Met Gin Arg Gin Asp Phe His Leu Pro Leu 
820 825 830 

Met He Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys He 
835 840 845 

Glu Pro Lys Tyr Ser Asn Asp Ala Val Val Tyr Val Thr Asp Ala Ser 
850 855 860 

Arg Ala Val Gly Val Ala Thr Gin Leu Leu Ser Lys Glu Leu Lys Ala 
865 870 875 880 

Gly Phe Val Gin Lys Thr Arg Glu Glu Tyr He Asp Val Arg Glu Arg 
885 890 895 

Thr Ala Asn Arg Ser Ala Arg Thr Glu Arg Leu Ser Tyr Ala Ala Ala 
900 905 910 

He Ala Lys Lys Pro Gin Phe Asp Trp Ala Thr Tyr Thr Pro Val Lys 
915 920 925 

Pro Thr Phe Thr Gly Thr Arg Val Leu Asp Asn He Asp Leu Asn Val 
930 935 940 

Leu Ala Glu Tyr He Asp Trp Thr Pro Phe Phe He Ser Trp Asp Leu 
945 950 955 960 

Ala Gly Lys Phe Pro Arg He Leu Glu Asp Glu Val Val Gly Glu Ala 
965 970 975 

Ala Thr Ala Leu Tyr Lys Asp Ala Arg Glu Met Leu Thr Lys Leu He 
980 985 990 

Asp Glu Lys Leu He Ser Ala Arg Ala Val Phe Gly Phe Trp Pro Ala 
995 1000 1005 

Asn Gin Val His Asp Asp Asp He Glu Leu Tyr Gly Asp Asp Gly Lys 
1010 1015 1020 

Pro Met Ala Arg Leu His His Leu Arg Gin Gin lie lie Lys Thr Asp 
1° 25 1030 1035 * 1040 

Gly Lys Pro Asn Phe Ser Leu Ala Asp Phe Val Ala Pro Lys Asp Ser 
1045 1050 1055 

Glu Val Thr Asp Tyr Val Gly Gly Phe lie Thr Thr Ala Gly lie Gly 
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1060 1065 1070 

Ala Glu Glu Val Ala Lye Ala Tyr Gin Asp Ala Gly Asp Asp Tyr Asn 
1075 1080 1085 

Ser He Met Val Lys Ala Leu Ala Asp Arg Leu Ala Glu Ala Cys Ala 
1050 1095 1100 

Glu Trp Leu His Gin Gin Val Arg Lys Glu His Trp Gly Tyr Ala Lys 
1105 mo 1U5 1120 

Asp Glu Ala Leu Asp Asn Glu Ala Leu He Lys Glu Gin Tyr Ser Gly 
H25 H30 H35 

He Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp His Thr Glu Lys 
H40 114S 1150 

Ala Thr Leu Phe Ala Leu Leu Asp Pro Glu Ala Gin Glu Met Arg Ala 
H55 H60 H65 

Gly Arg Ser Gly Val Phe Leu Thr Glu His Tyr Ala Met Phe Pro Ala 
H70 H75 1180 

Ala Ala Val Ser Gly Trp Tyr Phe Ala His Pro Gin Ala Gin Tyr Phe 
1185 H90 H95 120 o 

Ala Val Gly Lys Val Asp Lys Asp Gin Val Gin Ser Tyr Thr Ser Arg 
1205 1210 1215 

Lys Gly Gin Glu Leu Ser Leu Thr Glu Arg Trp Leu Ala Pro Asn Leu 
1220 1225 1230 

Gly Tyr Asp Asn 
1235 



<210> 27 
<211> 3705 
<212> DNA 

<213> Pseudomonas aeruginosa 

<220> 
<221> CDS 
<222> (1) . . (3702) 
<223> RPA01772 



<400> 27 

atg tec age ccg etc acc gat cgc 
Met Ser Ser Pro Leu Thr Asp Arg 
1 5 



age gec cgc ctg caa gec etc cag 48 
Ser Ala Arg Leu Gin Ala Leu Gin 
10 is 



cac gec etc agg gaa cgt ate ctg ate etc gat ggc ggc atg ggc acc 96 
His Ala Leu Arg Glu Arg He Leu He Leu Asp Gly Gly Met Gly Thr 
20 25 30 

atg ate cag age tac aag ctg gaa gag gec gac tac cgc ggc gag cgc 144 
Met He Gin Ser Tyr Lys Leu Glu Glu Ala Asp Tyr Arg Gly Glu Arg 
35 40 45 



ttc gee gac tgg ccg age gac gtg aaa ggc aac aac gac etc ttg ctg 192 
Phe Ala Asp Trp Pro Ser Asp Val Lys Gly Asn Asn Asp Leu Leu Leu 
50 55 60 
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ctg age cgc ccg gac gtg ate cag gec ate gag aag gee tac etc gac 240 
Leu Ser Arg Pro Asp Val lie Gin Ala lie Glu Lys Ala Tyr Leu Abd 
« 70 75 so 

gec ggc gec gac ate etc gag acc aac ace ttc aac gee acc cag gtg 286 
Ala Gly Ala Asp He Leu Glu Thr Asn Thr Phe Asn Ala Thr Gin Val 
85 90 95 

tec cag gec gac tac ggc atg cag teg ctg gec tac gaa etc aac gtc 336 
Ser Gin Ala Asp Tyr Gly Met Gin Ser Leu Ala Tyr Glu Leu Asn Val 
100 105 no 

gaa ggg gcg cgc ctg gee cgc cag gtg gcg gac gcg aag acc gec gag 384 
Glu Gly Ala Arg Leu Ala Arg Gin Val Ala Asp Ala Lys Thr Ala Glu 
115 120 125 

acc ccg gac aag ccg cgt ttc gtc gec ggc gtg etc ggc ccg acc age 432 
Thr Pro Asp Lys Pro Arg Phe Val Ala Gly Val Leu Gly Pro Thr Ser 
130 135 140 

cgc acc tgc teg att tec ccg gac gtg aac aac ccc ggc tac cgc aac 480 
Arg Thr Cys Ser He Ser Pro Asp Val Asn Asn Pro Gly Tyr Arg Asn 
145 150 155 160 

gtc acc ttc gac gaa ctg gtg gag aac tac gtc gag gcg acc cga ggc 528 
Val Thr Phe Asp Glu Leu Val Glu Asn Tyr Val Glu Ala Thr Arg Gly 
165 170 175 

ctg ate gaa ggc ggc gee gac ctg ate ctg ate gag acc ate ttc gac 576 
Leu He Glu Gly Gly Ala Asp Leu He Leu He Glu Thr He Phe Asp 
180 185 190 

acc etc aac gee aag gcg gcg ate ttc gee gtc cag ggc gtg ttc gag 624 
Thr Leu Asn Ala Lys Ala Ala He Phe Ala Val Gin Gly Val Phe Glu 
195 200 205 

gaa etc ggc gtg gag ctg ccg ate atg ate tec gga acc ate acc gac 672 
Glu Leu Gly Val Glu Leu Pro He Met He Ser Gly Thr He Thr Asp 
210 215 220 



gee tec ggc cgc acc ctg teg ggc cag acc acc gag gee ttc tgg aac 
Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr Thr Glu Ala Phe Trp Asn 
225 230 235 240 

teg gtg egg cat gee egg ccg ate teg gta ggc ctg aac tgc gee etc 
Ser Val Arg His Ala Arg Pro He Ser Val Gly Leu Asn Cys Ala Leu 
245 250 255 

ggc gec aag gaa ttg egg ccg tac ate gag gaa ctg teg acc aag gec 
Gly Ala Lys Glu Leu Arg Pro Tyr He Glu Glu Leu Ser Thr Ly B Ala 
260 265 270 

gac act cat gtc teg gec cac ccc aac gee ggc ctg ccg aac gec ttc 
Asp Thr His Val Ser Ala His Pro Asn Ala Gly Leu Pro Asn Ala Phe 
275 280 285 



720 



768 



816 



864 



ggc gaa tac gac gaa teg ccg gcg gaa atg gee gtg gtg gtc gag gaa 912 
Gly Glu Tyr Asp Glu Ser Pro Ala Glu Met Ala Val Val Val Glu Glu 
29 0 295 300 



ttc gee gee gee ggc ttc etc aat ate gtc ggc ggc tgc tgc ggc acc 



960 
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Phe Ala Ala Ala Gly Phe Leu Asn lie Val Gly Gly Cys Cys Gly Thr 
305 310 315 320 

acc ccg gcg cac ate gag gcg ate gee aag gca gtg gee aag, tac ccg 1008 
Thr Pro Ala His lie Glu Ala He Ala Lys Ala Val Ala Lye Tyr Pro 
325 330 335 

ccg egg gec ate ccg gag att ccc egg gee tgt cgc ctg tec ggc ctg 1056 
Pro Arg Ala He Pro Glu He Pro Arg Ala Cys Arg Leu Ser Gly Leu 
340 345 

gag ccg ttc acc ate gac cgc age teg ctg ttc gtc aac gtc ggc gag 1104 
Glu Pro Phe Thr He Asp Arg Ser Ser Leu Phe Val Asn Val Gly Glu 
355 360 365 

cgc acc aac ate acc ggt teg gee aag ttc gee egg ctg ate cgc gag 1152 
Arg Thr Asn He Thr Gly Ser Ala Lys Phe Ala Arg Leu He Arg Glu 
370 375 380 

gaa aac tac gcg gaa get etc gag gtc gee cag cag cag gtg gaa gee 1200 
Glu Asn Tyr Ala Glu Ala Leu Glu Val Ala Gin Gin Gin Val Glu Ala 
385 390 395 400 

ggc gee cag gtg ate gac ate aac atg gac gaa ggc atg ctg gac teg 1248 
Gly Ala Gin Val He Asp He Asn Met Asp Glu Gly Met Leu Asp Ser 
405 410 4 15 

aag gcg gee atg gtc acc ttc etc aac ctg ate gec tec gag, ccc gac 1296 
Lys Ala Ala Met Val Thr Phe Leu Asn Leu He Ala Ser Glu Pro Asp 
4 20 425 430 

ate teg cgc gtg ccg ate atg ate gac tec tec aag tgg gaa gtg ate 1344 
He Ser Arg Val Pro He Met He Asp Ser Ser Lys Trp Glu Val He 
435 440 445 

gag gec ggc ctg aag tgc ate cag ggc aag ggc ate gtc aac teg ate 1392 
Glu Ala Gly Leu Lys Cys He Gin Gly Lys Gly He Val Asn Ser He 
450 455 460 

teg atg aag gaa ggc gtc gag gee ttc aag cac cat gee cgc ctg tgc 1440 
Ser Met Lys Glu Gly Val Glu Ala Phe Lys His His Ala Arg Leu Cys 
465 470 475 480 

aag cgc tac ggc gee gcg gtg gtg gtg atg gee ttc gac gag gac ggc 14 88 
Lys Arg Tyr Gly Ala Ala Val Val Val Met Ala Phe Asp Glu Asp Gly 
485 490 495 

cag gec gac acc cag gcg cgc aag gaa gaa ate tgc aag cgc tec tac 1536 
Gin Ala Asp Thr Gin Ala Arg Lys Glu Glu He Cys Lys Arg Ser Tyr 
500 505 510 

gac ate ctg gtc gac gaa gtc ggc ttc cca ccg gaa gac ate ate ttc 1584 
Asp He Leu Val Asp Glu Val Gly Phe Pro Pro Glu Asp He He Phe 
515 520 525 

gat gcg aac ate ttc gee ate gec acc ggc ate gag gaa cac aac aac 1632 
Asp Ala Asn He Phe Ala He Ala Thr Gly He Glu Glu His Asn Asn 
530 535 540 



tac gcg gtc gat ttc ate aac gee tgc gee tac ate cgc gac aac etc 
Tyr Ala Val Asp Phe He Asn Ala Cys Ala Tyr He Arg Asp Asn Leu 
545 550 555 560 



16B0 
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ccc tac gcc ctg age teg ggc ggg gtg tec aac gtg tec ttc teg ttc 1720 
Pro Tyr Ala Leu Ser Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe 
565 570 575 

cgc ggc aac aac ccg gta cgc gag gcg ate cac teg gtg ttc etc tac 1776 
Arg Gly Asn Asn Pro Val Arg Glu Ala He His Ser Val Phe Leu Tyr 
580 585 590 

tac gcg ate cgc aac ggc ctg ace atg ggc ate gtc aac gcc ggc cag 1824 
Tyr Ala He Arg Asn Gly Leu Thr Met Gly lie Val Asn Ala Gly Gin 
595 600 605 

ctg gaa ate tac gac gag att ccg aaa gcg ctg cgc gac egg gtc gag 1872 
Leu Glu He Tyr Asp Glu He Pro Lys Ala Leu Arg Asp Arg Val Glu 
610 615 620 

gac gtg gtg etc aac cgc acg ccc gag gcc ace gag gcc ctg ctg gcg 1920 
Asp Val Val Leu Asn Arg Thr Pro Glu Ala Thr Glu Ala Leu Leu Ala 
^25 630 635 640 

ate gcc gac gac tac aag ggc ggc ggc gcg gtc aag gag gcc gag gac 1968 
He Ala Asp Asp Tyr Lys Gly Gly Gly Ala Val Lys Glu Ala Glu Asp 
645 650 655 

gag gaa tgg cgc age tac age gtc gag aag cgc etc gag cat gcg ctg 2016 
Glu Glu Trp Arg Ser Tyr Ser Val Glu Lys Arg Leu Glu His Ala Leu 
660 665 670 

gtc aag ggc ate acc acc tgg ate gtc gag gac ace gag gaa tgc cgc 2064 
Val Lys Gly He Thr Thr Trp He Val Glu Asp Thr Glu Glu Cys Arg 
675 680 685 

cag cag tgt gcg cgt ccc ate gag gtc ate gaa ggt ccg ctg atg tec 2112 
Gin Gin Cys Ala Arg Pro He Glu Val He Glu Gly Pro Leu Met Ser 
690 695 700 

ggg atg aac gtg gtc ggc gac ctg ttc ggc gcc ggc aag atg ttc etc 2160 
Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu 
705 710 715 720 

ccg cag gtg gtc aag tec gcg cga gtg atg aag cag gcg gtg gcc cac 2208 
Pro Gin Val Val Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala His 
725 730 735 

ctg att ccc ttc ate gag gcg gag aaa ggc gac aag ccg gaa gcc aag 2256 
Leu He Pro Phe He Glu Ala Glu Lys Gly Asp Lys Pro Glu Ala Lys 
740 745 750 

ggc aag ate ctg atg gcc acg gtg aag ggc gac gtg cac gac ate ggc 2304 
Gly Lys He Leu Met Ala Thr Val Lys Gly Asp Val His Asp He Gly 
755 760 * 765 

aag aac ate gtc ggc gtg gtg etc ggc tgc aac ggc tat gac gtg gtc 2352 
Lys Asn He Val Gly Val Val Leu Gly Cys Asn Gly Tyr Asp Val Val 
770 775 780 

gac etc ggc gtg atg gtg ccg gcg gag aag ate ctg cag acc gcc ate 2400 
Asp Leu Gly Val Met Val Pro Ala Glu Lys lie Leu Gin Thr Ala He 
785 790 795 800 

gcc gag aaa tgc gac ate ate ggc ctg tct ggc ctg ate acg ccg teg 2448 
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Ala Glu Lys Cys Asp lie lie Gly Leu Ser Gly Leu He Thr Pro Ser 
805 810 815 

ctg gac gag atg gtc cac gtc gcc aag gaa atg cag egg cag aat ttc 2496 
Leu Asp Glu Met Val His Val Ala Lys Glu Met Gin Arg Gin Asn Phe 
820 825 830 

cag ttg ccg ctg atg ate ggc ggc gcc act acc teg aag gcg cat ace 2544 
Gin Leu Pro Leu Met He Gly Gly Ala Thr Thr Ser Lys Ala His Thr 
835 840 845 

gcg gtg aag ate gat ccg cag tac age aac gac gcg gtg gtc tac gtc 2592 
Ala Val Lys He Asp Pro Gin Tyr Ser Asn Asp Ala Val Val Tyr Val 
850 855 860 

acc gac gcc teg cgc gcg gta ggc gtg gcc acc age ctg ctg tec aag 2640 
Thr Asp Ala Ser Arg Ala Val Gly Val Ala Thr Ser Leu Leu Ser Lys 
865 870 875 880 

gag ctg aag gcc gac tac gtg gcc cgc acc cgc gcc gac tac gcg gtg 2688 
Glu Leu Lys Ala Asp Tyr Val Ala Arg Thr Arg Ala Asp Tyr Ala Val 
885 890 895 

gtc cgc gaa cgc acg gcc aac cgc age gcc cgc acc gag egg ctg age 2736 
Val Arg Glu Arg Thr Ala Asn Arg Ser Ala Arg Thr Glu Arg Leu Ser 
900 90S ~ 910 

tac gaa cag gcg ate gcc aac aag ccg gcg ttc gac tgg gcc ggc tac 2784 
Tyr Glu Gin Ala He Ala Asn Lys Pro Ala Phe Asp Trp Ala Gly Tyr 
915 920 925 

cag gcg ccg acg cct tec ttc acc ggc gtc agg gtg etc gac gag ate 2832 
Gin Ala Pro Thr Pro Ser Phe Thr Gly Val Arg Val Leu Asp Glu He 
930 935 940 

gac etc gcg gtg etc gcc gag tac ate gac tgg acg ccg ttc ttc att 2880 
Asp Leu Ala Val Leu Ala Glu Tyr He Asp Trp Thr Pro Phe Phe He 
945 950 955 960 

tec tgg gac ctg gcc ggc aag tac ccg cgc ate etc acc gac gag gtg 2928 
Ser Trp Asp Leu Ala Gly Lys Tyr Pro Arg He Leu Thr Asp Glu Val 
965 970 975 

gtc ggc gag gcc gcc acc teg ttg ttc aac gac gcc cag gcg atg ctg 2976 
Val Gly Glu Ala Ala Thr Ser Leu Phe Asn Asp Ala Gin Ala Met Leu 
980 985 990 

aag aag ctg ate gac gag aag ctg ate aag gcc cgc gcg gtg ttc ggc 3024 
Lys Lys Leu He Asp Glu Lys Leu He Lys Ala Arg Ala Val Phe Gly 
995 1000 1005 

ttc tgg ccg gcc aac cag gtc gag cac gac gac ctg gag gtc tac ggc 3072 
Phe Trp Pro Ala Asn Gin Val Glu His Asp Asp Leu Glu Val Tyr Gly 
1010 1015 1020 

gcc gat ggc gag acc etc gcc acc ctg cac cac ctg egg cag cag acg 3120 
Ala Asp Gly Glu Thr Leu Ala Thr Leu His His Leu Arg Gin Gin Thr 
1025 1030 1035 1040 

ate aag ccg gac ggc aag ccg aac ctg teg ctg gcc gat ttc gtc gcg 3168 
He Lys Pro Asp Gly Lys Pro Asn Leu Ser Leu Ala Asp phe Val Ala 
1045 1050 1055 
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ccg aag gaa age ggc gtg cgc gac tac ate ggc ggc ttc ate acc acc 3216 
Pro Lys Glu Ser Qly Val Arg Asp Tyr lie Qly Qly Phe lie Thr Thr 
1060 1065 1070 

gec ggg ate ggc gee gag gaa gtg gec aag gcg tac gaa gec aag ggc 3264 
Ala Gly lie Gly Ala Glu Glu Val Ala Lys Ala Tyr Glu Ala Lys Gly 
1075 1080 1085 

gac gac tac aac age ate atg gtc aag gcg etc gec gac cgc etc gec 3312 
Asp Asp Tyr Asn Ser He Met Val Lys Ala Leu Ala Asp Arg Leu Ala 
1090 1095 iioo 

gaa gec tgc gec gag tgg ctg cac gag egg gtg cgc aag gag tac tgg 3360 
Glu Ala Cys Ala Glu Trp Leu His Glu Arg Val Arg Lys Glu Tyr Trp 
1105 IHO ins 1120 

ggc tac gec cgc gac gaa cac etc gac aac gag gec ttg ate aag gag 3408 
Gly Tyr Ala Arg Asp Glu His Leu Asp Aen Glu Ala Leu He Lys Glu 
H25 H30 H35 

caa tac gtc ggc ate cgc ccg gca ccg ggc tac ccg gee tgc ccc gac 3456 
Gin Tyr Val Gly He Arg Pro Ala Pro Gly Tyr Pro Ala CyB Pro Asp 
1140 H45 H50 

cat acc gag aaa ggc act ctg ttc gaa ctg etc gat ccg cag ggc ctg 3504 
His Thr Glu Lys Gly Thr Leu Phe Glu Leu Leu Asp Pro Gin Gly Leu 
1155 H60 H65 

tec ggc gtc age ctg acc gag cac tac gcg atg ttc ccg gec gcg gcg 3552 
Ser Gly Val Ser Leu Thr Glu Hia Tyr Ala Met Phe Pro Ala Ala Ala 
1170 1175 1180 



gtc age ggt tgg tat ttc gec cac ccg cag gcg cag tac ttc gcg gtc 3600 
Val Ser Gly Trp Tyr Phe Ala His Pro Gin Ala Gin Tyr Phe Ala Val 
1185 H90 1195 1200 



ggc aag ate gac aag gac cag gtg gaa cgc tac age cag cgc aag ggc 
Gly Lys He Asp Lys Asp Gin Val Glu Arg Tyr Ser Gin Arg Lys Gly 
1205 1210 1215 

cag gaa gee age gtc age gag cgc tgg ctg gcg ccg aac ctt ggc tac 
Gin Glu Ala Ser Val Ser Glu Arg Trp Leu Ala Pro Asn Leu Gly Tyr 
1220 1225 1230 

gat gac tga 
Asp Asp 



3648 



3696 



3705 



<210> 28 
<211> 1234 
<212> PRT 

<213> Pseudomonas aeruginosa 
<400> 28 

Met Ser Ser Pro Leu Thr Asp Arg Ser Ala Arg Leu Gin Ala Leu Gin 
1 5 10 15 



His Ala Leu Arg Glu Arg He Leu He Leu Asp Gly Gly Met Gly Thr 
20 25 30 
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Met He Gin Ser Tyr Lys Leu Qlu Glu Ala Asp Tyr Arg Gly Glu Arg 
35 40 45 

Phe Ala Asp Trp Pro Ser Asp Val Lys Gly Asn Asn Asp Leu Leu Leu 
50 55 60 

Leu Ser Arg Pro Asp Val He Gin Ala He Glu Lys Ala Tyr Leu Asp 
65 70 75 80 

Ala Gly Ala Asp He Leu Glu Thr Asn Thr Phe Asn Ala Thr Gin Val 
85 90 95 

Ser Gin Ala Asp Tyr Gly Met Gin Ser Leu Ala Tyr Glu Leu Asn Val 
100 105 no 

Glu Gly Ala Arg Leu Ala Arg Gin Val Ala Asp Ala Lys Thr Ala Glu 
115 120 125 

Thr Pro Asp Lys Pro Arg Phe Val Ala Gly Val Leu Gly Pro Thr Ser 
130 135 140 

Arg Thr Cys Ser He Ser Pro Asp Val Asn Asn Pro Gly Tyr Arg Asn 
145 150 155 160 

Val Thr Phe Asp Glu Leu Val Glu Asn Tyr Val Glu Ala Thr Arg Gly 
165 170 175 

Leu He Glu Gly Gly Ala Asp Leu He Leu He Glu Thr He Phe Asp 
180 185 190 

Thr Leu Asn Ala Lys Ala Ala He Phe Ala Val Gin Gly Val Phe Glu 
195 200 205 

Glu Leu Gly Val Glu Leu Pro He Met He Ser Gly Thr He Thr Asp 
210 215 220 

Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr Thr Glu Ala Phe Trp Asn 
225 230 235 240 

Ser Val Arg His Ala Arg Pro He Ser Val Gly Leu Asn Cys Ala Leu 
245 250 255 

Gly Ala Lys Glu Leu Arg Pro Tyr He Glu Glu Leu Ser Thr Lys Ala 
260 265 270 

Asp Thr His Val Ser Ala His Pro Asn Ala Gly Leu Pro Asn Ala Phe 
275 280 285 

Gly Glu Tyr Asp Glu Ser Pro Ala Glu Met Ala Val Val Val Glu Glu 
290 295 300 

Phe Ala Ala Ala Gly Phe Leu Asn He Val Gly Gly Cys Cys Gly Thr 
305 310 315 320 

Thr Pro Ala His He Glu Ala He Ala Lys Ala Val Ala Lys Tyr Pro 
325 330 335 

Pro Arg Ala He Pro Glu He Pro Arg Ala Cys Arg Leu Ser Gly Leu 
340 345 350 

Glu Pro Phe Thr He Asp Arg Ser Ser Leu Phe Val Asn Val Gly Glu 
355 360 365 
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Arg Thr Asn He Thr Gly Ser Ala Lys Phe Ala Arg Leu He Arg Glu 
370 375 380 

Glu'Asn Tyr Ala Glu Ala Leu Glu Val Ala Gin Gin Gin Val Glu Ala 
385 390 395 , 400 

Gly Ala Gin Val He Asp He Asn Met Asp Glu Gly Met Leu Asp Ser 
405 410 415 

Lys Ala Ala Met Val Thr Phe Leu Asn Leu He Ala Ser Glti Pro Asp 
420 425 430 

He Ser Arg Val Pro He Met He Asp Ser Ser Lys Trp Glu Val lie 
435 440 445 

Glu Ala Gly Leu Lys Cys He Gin Gly Lys Gly He Val Asn Ser He 
450 455 460 

Ser Met Lys Glu Gly Val Glu Ala Phe Lys His His Ala Arg Leu Cys 
465 470 475 480 

Lys Arg Tyr Gly Ala Ala Val Val Val Met Ala Phe Asp Glu Asp Gly 
485 490 495 

Gin Ala Asp Thr Gin Ala Arg Lys Glu Glu He Cys Lys Arg Ser Tyr 
500 505 510 ' 

Asp He Leu Val Asp Glu Val Gly Phe Pro Pro Glu Asp He He Phe 
515 520 525 

Asp Ala Asn He Phe Ala He Ala Thr Gly He Glu Glu His Asn Asn 
530 535 540 

Tyr Ala Val Asp Phe He Asn Ala Cys Ala Tyr He Arg Asp Asn Leu 
545 550 555 560 

Pro Tyr Ala Leu Ser Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe 
565 570 575 

Arg Gly Asn Asn Pro Val Arg Glu Ala lie His Ser Val Phe Leu Tyr 
580 585 590 

Tyr Ala He Arg Asn Gly Leu Thr Met Gly He Val Asn Ala Gly Gin 
595 600 60S 

Leu Glu lie Tyr Asp Glu He Pro Lys Ala Leu Arg Asp Arg Val Glu 
610 615 620 

Asp Val Val Leu Asn Arg Thr Pro Glu Ala Thr Glu Ala Leu Leu Ala 
625 630 635 640 

lie Ala Asp Asp Tyr Lys Gly Gly Gly Ala Val Lys Glu Ala Glu Asp 
645 650 655 

Glu Glu Trp Arg Ser Tyr Ser Val Glu Lys Arg Leu Glu His Ala Leu 
660 665 670 

Val Lys Gly He Thr Thr Trp He Val Glu Asp Thr Glu Glu Cys Arg 
675 680 685 

Gin Gin Cys Ala Arg Pro lie Glu Val lie Glu Gly Pro Leu Met Ser 
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690 695 700 

Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu 
705 710 715 720 

Pro Gin Val Val Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala His 
725 730 735 

Leu He Pro Phe He Glu Ala Glu Lys Gly Asp Lys Pro Glu Ala Lys 
740 745 750 

Gly Lys lie Leu Met Ala Thr Val Lys Gly Asp Val His Asp He Gly 
755 760 765 

Lys Asn He Val Gly Val Val Leu Gly Cys Asn Gly Tyr Asp Val Val 
770 775 780 

Asp Leu Gly Val Met Val Pro Ala Glu Lys He Leu Gin Thr Ala He 
785 790 795 800 

Ala Glu Lys Cys Asp lie lie Gly Leu Ser Gly Leu He Thr Pro Ser 
805 810 815 

Leu Asp Glu Met Val His Val Ala Lys Glu Met Gin Arg Gin Asn Phe 
820 825 830 

Gin Leu Pro Leu Met lie Gly Gly Ala Thr Thr Ser Lys Ala His Thr 
835 840 845 

Ala Val Lys He Asp Pro Gin Tyr Ser Asn Asp Ala Val Val Tyr Val 
850 855 860 

Thr Asp Ala Ser Arg Ala Val Gly Val Ala Thr Ser Leu Leu Ser Lys 
865 870 875 880 

Glu Leu Lys Ala Asp Tyr Val Ala Arg Thr Arg Ala Asp Tyr Ala Val 
885 890 895 

Val Arg Glu Arg Thr Ala Asn Arg Ser Ala Arg Thr Glu Arg Leu Ser 
500 905 9io 

Tyr Glu Gin Ala lie Ala Asn Lys Pro Ala Phe Asp Trp Ala Gly Tyr 
915 920 925 

Gin Ala Pro Thr Pro Ser Phe Thr Gly Val Arg Val Leu Asp Glu He 
930 935 940 

Asp Leu Ala Val Leu Ala Glu Tyr lie Asp Trp Thr Pro Phe Phe lie 
945 950 955 960 

Ser Trp Asp Leu Ala Gly Lys Tyr Pro Arg lie Leu Thr Asp Glu Val 
965 970 975 

Val Gly Glu Ala Ala Thr Ser Leu Phe Asn Asp Ala Gin Ala Met Leu 
980 985 990 

Lys Lys Leu lie Asp Glu Lys Leu lie Lys Ala Arg Ala Val Phe Gly 
995 1000 1005 

Phe Trp Pro Ala Asn Gin Val Glu His Asp Asp Leu Glu Val Tyr Gly 
1010 1015 1020 
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Ala Asp Gly Glu Thr Leu Ala Thr Leu His His Leu Arg Gin Gin Thr 
1Q 25 1030 1035 " 1040 

lie Lys Pro Asp Gly Lys Pro Asn Leu Ser Leu Ala Asp Phe Val Ala 
1045 1050 1055 

Pro Lys Glu Ser Gly Val Arg Asp Tyr He Gly Gly Phe He Thr Thr 
10^0 1065 1070 

Ala Gly He Gly Ala Glu Glu Val Ala Lys Ala Tyr Glu Ala Lys Gly 
1075 1080 1085 

Asp Asp Tyr Asn Ser He Met Val Lys Ala Leu Ala Asp Arg Leu Ala 
1090 1095 iioo 

Glu Ala Cys Ala Glu Trp Leu His Glu Arg Val Arg Lys Glu Tyr Trp 
H05 mo ins ii 20 

Gly Tyr Ala Arg Asp Glu His Leu Asp Asn Glu Ala Leu He Lys Glu 
1125 H30 n35 

Gin Tyr Val Gly He Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp 
1140 H45 use 

His Thr Glu Lys Gly Thr Leu Phe Glu Leu Leu Asp Pro Gin Gly Leu 
H55 H60 H65 

Ser Gly Val Ser Leu Thr Glu His Tyr Ala Met Phe Pro Ala Ala Ala 
H70 H75 H80 

Val Ser Gly Trp Tyr Phe Ala His Pro Gin Ala Gin Tyr Phe Ala Val 
1185 H90 1195 . 1200 

Gly Lys He Asp Lys Asp Gin Val Glu Arg Tyr Ser Gin Arg Lys Gly 
1205 1210 1215 

Gin Glu Ala Ser Val Ser Glu Arg Trp Leu Ala Pro Asn Leu Gly Tyr 
1220 1225 1230 

Asp Asp 



<210> 29 
<211> 3714 
<212> DNA 

<213> Nitrosomas europeae 

<220> 
<221> CDS 
<222> (1)..(3711) 
<223> RNE01732 

<400> 29 

atg aca atg cat gaa cgt get gat ttg ctg aaa egg ttg ctt gec gag 48 

Met Thr Met His Glu Arg Ala Asp Leu Leu Lys Arg Leu Leu Ala Glu 
1 5 io 15 



cgt ate ctg atg etc gac ggt gee atg ggt acg atg ate cag age tac 96 
Arg lie Leu Met Leu Asp Gly Ala Met Gly Thr Met He Gin Ser Tyr 
20 25 30 
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aaa ctg acc gag teg gat tat egg ggg gaa cgt ttt gec gat ttt ccg 144 
Lys Leu Thr Glu Ser Asp Tyr Arg Gly Glu Arg Phe Ala Asp Phe Pro 
35 40 45 

cat gat etc aaa ggc aac aat gat ctg etc tgc ctg acc aga ccg gaa 192 
His Asp Leu Lys Gly Asn Asn Asp Leu Leu Cys Leu Thr Arg Pro Glu 
50 55 60 

gtc ate cgc tec att cat cgt get tac etc gaa gee ggg teg gat ate 240 
Val He Arg Ser lie His Arg Ala Tyr Leu Glu Ala Gly Ser Asp He 
65 70 75 80 

ate gag acc aac acg ttc aac teg aat gcg ccg teg atg gcg gac tac 288 
He Glu Thr Asn Thr Phe Asn Ser Asn Ala Pro Ser Met Ala Asp Tyr 
85 90 95 

cac atg cag gat ctg gtg tat gaa ctg aat gtg gcg ggt gcg cgc ctg 336 
His Met Gin Asp Leu Val Tyr Glu Leu Asn Val Ala Gly Ala Arg Leu 
100 105 110 

gcg tgt gag gaa gcg egg gca atg gaa acg cag caa cct gac egg ccc 384 
Ala Cys Glu Glu Ala Arg Ala Met Glu Thr Gin Gin Pro Asp Arg Pro 
H5 120 125 

cgt ttc gtt gee ggt gtg ate ggg cct acc acc aaa acg get tea etc 432 
Arg Phe Val Ala Gly Val lie Gly Pro Thr Thr Lys Thr Ala Ser Leu 
130 135 140 

tea ccg gat gtc aat gat cct gga ttc egg gee att acc ttc gat gat 480 
Ser Pro Asp Val Asn Asp Pro Gly Phe Arg Ala He Thr Phe Asp Asp 
145 150 155 160 

ctg gtg gaa age tat acc gag teg gtg cgc ggg ctg ate gac gga ggc 528 
Leu Val Glu Ser Tyr Thr Glu Ser Val Arg Gly Leu He Asp Gly Gly 
165 170 175 

gcg gat att ctg ctg gtc gaa acc att ttt gac acc ttg aat gee aaa 576 
Ala Asp He Leu Leu Val Glu Thr He Phe Asp Thr Leu Asn Ala Lys 
180 185 190 

gee gca ttg ttt gee ate gat cag tat ttc gaa acg cat gga tta cgt 624 
Ala Ala Leu Phe Ala He Asp Gin Tyr Phe Glu Thr His Gly Leu Arg 
195 200 205 

ctg ccg gtg atg ata teg gtc acg att acc gat get teg gga cgt aat 672 
Leu Pro Val Met He Ser Val Thr He Thr Asp Ala Ser Gly Arg Asn 
210 215 220 

ctt tec ggg cag aca ccg gaa get ttc tgg aat teg gta egg cat gca 720 
Leu Ser Gly Gin Thr Pro Glu Ala Phe Trp Asn Ser Val Arg His Ala 
225 230 235 240 

cgt ccg ctt teg gtg gga ate aac tgc gcg ttg ggt gcg gag ttg atg 768 
Arg Pro Leu Ser Val Gly He Asn Cys Ala Leu Gly Ala Glu Leu Met 
245 250 255 



cgc ccc tac gtg gaa gag ttg tec aat gtg get gag gtt ttc acc age 
Arg Pro Tyr Val Glu Glu Leu Ser Asn Val Ala Glu Val Phe Thr Ser 
260 265 270 



816 



gee cat ccc aat gee ggc ttg cct aat ccc ttg gcg gaa acc ggt tat 864 
Ala His Pro Asn Ala Gly Leu Pro Asn Pro Leu Ala Glu Thr Gly Tyr 
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275 280 285 

gac gaa acg ccg gaa tat acc gcc cgt ctg ate aag gat ttt gcg caa 912 
Asp Glu Thr Pro Glu Tyr Thr Ala Arg Leu lie Lye Asp Phe Ala Gin 
290 295 300 



960 



1008 



1056 



1104 



tec ggg ttc gtc aac att gtc ggc ggc tgc tgt ggc act aca ccg aaa 
Ser Gly Phe Val Aen He Val Gly Gly Cys Cys Gly Thr Thr Pro Lye 
305 310 315 320 

cat ate gcg gcc att gca gaa gcg gta egg gac ate cct cc& cgc cca 
His He Ala Ala He Ala Glu Ala Val Arg Asp He Pro Pro Arg Pro 
325 330 335 

ctg ccc gat att cct aaa aaa ctg agg ctt tec ggc etc gag ccg etc 
Leu Pro Asp He Pro Lys Lys Leu Arg Leu Ser Gly Leu Glu Pro Leu 
340 345 350 

aat ate gat gaa cat tec ctg ttc gta aac gtg ggt gaa cgt acc aat 
Asn He Asp Glu His Ser Leu Phe Val Asn Val Gly Glu Arg Thr Asn 
355 360 365 

gtc acc ggc tec aag gca ttt gcc egg ctg att etc aat ggc ggt tat 1152 
Val Thr Gly Ser Lys Ala Phe Ala Arg Leu He Leu Asn Gly Gly Tyr 
370 375 380 

get gaa ggg ctg gtg ate gcg cgc age cag gtg gag aac ggc £ca caa 1200 
Ala Glu Gly Leu Val He Ala Arg Ser Gin Val Glu Asn Gly Ala Gin 
385 390 395 400 

ate ate gat ate aac atg gat gaa gcg atg ctg gat tea cag aag gcg 1248 
He He Asp He Asn Met Asp Glu Ala Met Leu Asp Ser Gin Lys Ala 
405 410 415 

M fc ! ^ C t ! t ° t9 3at Ct9f Ctc gct 9 CC aaa cca 9 at a tc age egg 1296 
Met Val Thr Phe Leu Asn Leu Leu Ala Ala Glu Pro Asp He Ser Arg 
«0 425 430 

ctg ccg ate atg ctc gat tec age aaa tgg teg gtg ate gaa gcc gga 1344 
Leu Pro He Met Leu Asp Ser Ser Lys Trp Ser Val He Glu Ala Gly 
435 440 445 

ctg aaa tgt gtc cag ggt aag gcg gtc ate aat tec ate age ctc aag 1392 
Leu Lys Cys Val Gin Gly Lys Ala Val He Asn Ser He Ser Leu Lys 
450 455 460 

gaa ggt gaa gcg gag ttt tta cat cat gcc agg ctg gcg cat cat tat 144 0 
Glu Gly Glu Ala Glu Phe Leu His His La Ar S LeS 111 Sg Sg £r 
465 4 ™ 475 480 

ggg gcc gcg gtg att gtc atg get ttc gac gaa acc ggg cag gcc gat 14 88 
Gly Ala Ala Val He Val Met Ala Phe Asp Glu Thr Gl? Gin Ala Isp 
485 490 495 

acc ttg cag cgc aag gtg gaa ate tgc acg cgt tgt tac cat aca ctg 1536 
Thr Leu Gin Arg Lys Val Glu He Cys Thr Arg Cys Tyr His Thr Leu 
500 505 510 



att gaa cag gcc gat ttc cca ccc gag gat ate att ttc gac ccc aat 
He Glu Gin Ala Asp Phe Pro Pro Glu Asp He He Phe Asp Pro Asn 
515 520 525 



1584 
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att ttt gcc att get acg ggt ate gaa gaa cac agt aac tat gca gtg 1632 
He Phe Ala He Ala Thr Gly He Glu Glu H1b Ser Asn Tyr Ala Val 
530 535 540 

gat ttt ate gag gcg aca cac gtc ate egg caa acg ctg cct tat gcc 1680 
Asp Phe He Glu Ala Thr His Val He Arg Gin Thr Leu Pro Tyr Ala 
545 550 555 560 

aaa gtc age ggg ggt gtt tec aat gtt tec ttc teg ttc egg ggt aac 1728 
Lys Val Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn 
565 570 575 

gaa ccg ate cgc gaa gcc att cat acc gca ttc ctg tat cac gcg gtc 1776 
Glu Pro He Arg Glu Ala He His Thr Ala Phe Leu Tyr His Ala Val 
580 585 590 

aag gca ggc atg acc atg ggt ate gtc aac gca ggt cag ctt ggg gtt 1824 
Lys Ala Gly Met Thr Met Gly He Val Asn Ala Gly Gin Leu Gly Val 
595 600 605 

tat tec gac att ccg ccc gat ctg ctg gaa cat gtc gag gat gta ctg 1872 
Tyr Ser Asp He Pro Pro Asp Leu Leu Glu His Val Glu Asp Val Leu 
610 615 620 



ctg aac egg egg cct gat gca acc gaa cgt ctg gtg gag ttt gcg gaa 
Leu Asn Arg Arg Pro Asp Ala Thr Glu Arg Leu Val Glu Phe Ala Glu 
625 630 635 



1920 



640 



cat ttc aag gga cag aaa aag gag cag ate gaa gat ctg tec tgg cgt 1968 
His Phe Lys Gly Gin Lys Lys Glu Gin He Glu Asp Leu Ser Trp Arg 
645 650 655 

gat gaa ccg gtg egg cag cgc ctg att cat gca ctg gtc agg ggt ate 2016 
Asp Glu Pro Val Arg Gin Arg Leu He His Ala Leu Val Arg Gly He 
660 665 670 

age acc tac ate gtc gag gat acc gag etc gtc egg cag gag ate gac 2064 
Ser Thr Tyr He Val Glu Asp Thr Glu Leu Val Arg Gin Glu He Asp 
675 680 685 

age cag gga ggc aag ccg ate gag gtg ate gaa ggc ccg etc atg gac 2112 
Ser Gin Gly Gly Lys Pro He Glu Val He Glu Gly Pro Leu Met Asp 
690 695 700 

ggc atg aat gta gtg ggg gat ctg ttt ggc gca ggc aag atg ttt ctg 2160 
Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu 
705 710 715 720 

cca cag gtg gtc aag teg gca egg gtg atg aag cag gcg gtt gcc tat 2208 
Pro Gin Val Val Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala Tyr 
725 730 735 

ctg ttg ccg tac ate gag gca gag aaa aaa att tec ggc gac age aag 2256 
Leu Leu Pro Tyr He Glu Ala Glu Lys Lys He Ser Gly Asp Ser Lys 
740 745 750 

ccc aag ggc aag gtg gtg ate get acc gtc aaa ggg gat gtg cat gat 2304 
Pro Lys Gly Lys Val Val He Ala Thr Val Lye Gly Asp Val His Asp 
755 760 765 

att ggc aag aat ate gtt tec gtc gtg ttg cag tgt aat aac ttt gaa 23S2 
He Gly Lys Asn He Val Ser Val Val Leu Gin Cys Asn Asn Phe Glu 
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770 775 780 

gtc ate aac atg ggg gtg atg gtc ccc agt gca cag att ctg gaa aca 2400 
Val lie Asn Met Gly Val Met Val Pro Ser Ala Gin He Leu Glu Thr 
785' 790 795 " 800 

gca cgc cgt gaa cag gtc gat atg ate ggt ctg tec ggc ctg ate ace 2448 
Ala Arg Arg Glu Gin Val Asp Met He Gly Leu Ser Gly Leu He Thr 
805 810 815 

cct teg ctg gaa gaa atg gcg cat gtt gee egg gaa atg ga(g cgt gaa 2496 
Pro Ser Leu Glu Glu Met Ala His Val Ala Arg Glu Met Glu Arg Glu 
820 825 " 830 

caa ttc acc gtt ccg ctg ctg ate ggt ggc gec ace act teg egg atg 2544 
Gin Phe Thr Val Pro Leu Leu He Gly Gly Ala Thr Thr Ser Arg Met 
835 840 845 

cat acg gca gtc aaa ate gca ccc cat tac ggt ggg gtg acc gta tgg 2592 
His Thr Ala Val Lys He Ala Pro His Tyr Gly Gly Val Thr Val Trp 
850 855 860 

gtg ccg gat gee age egg gca gtc ggg gtg tgc age aat ctg atg tea 2640 
Val Pro Asp Ala Ser Arg Ala Val Gly Val Cys Ser Asn Leu Met Ser 
865 870 875 880 

cag gat ctg cgt gat gac tat gtc egg cag gtc aag gee gag bag gag 2688 
Gin Asp Leu Arg Asp Asp Tyr Val Arg Gin Val Lys Ala Glu Gin Glu 
885 890 895 

aag age egg gtg cag cac cgc aac aag aaa ggg cca tec aag etc etc 2736 
Lys Ser Arg Val Gin His Arg Asn Lys Lys Gly Pro Ser Lys Leu Leu 
900 905 910 

act ttc gag gaa gee egg gee aac gca etc aag acg gat tgg get cgt 2784 
Thr Phe Glu Glu Ala Arg Ala Asn Ala Leu Lys Thr Asp Trp Ala Arg 
915 920 925 

tat act cca cca get ccg gat ttc ctg ggg ttg cgc acc etc aac aac 2832 
Tyr Thr Pro Pro Ala Pro Asp Phe Leu Gly Leu Arg Thr Leu Asn Asn 
930 935 940 

tat ccg ctg gaa aca ctg gtg ccg cac ate gac tgg aca cct ttc ttc 2880 
Tyr Pro Leu Glu Thr Leu Val Pro His He Asp Trp Thr Pro Phe Phe 
9*5 950 955 960 

cag gca tgg gaa ctg cac ggg cgc tat cct gee ate ctg cag gat gaa 2928 
Gin Ala Trp Glu Leu His Gly Arg Tyr Pro Ala He Leu Gin Asp Glu 
965 970 975 



etc gtc ggg gaa gca gee age aat ctg ttt cgc gat gee cag aat atg 

Leu Val Gly Glu Ala Ala Ser Asn Leu Phe Arg Asp Ala Gin Asn Met 
980 985 990 

etc aga aaa ate gtc gag caa aaa tgg etc acc gee aac gee gtt ate 

Leu Arg Lys He Val Glu Gin Lys Trp Leu Thr Ala Asn Ala Val He 
995 1000 1005 



2976 



3024 



ggc ctg ttc ccg gee aat acc gtc aat gga gat gat ate gag att tat 3072 
Gly Leu Phe Pro Ala Asn Thr Val Asn Gly Asp Asp He Glu He Tyr 
1010 1015 1020 
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get gac cgt agt cgc agt cag gtg ate atg ace tgg cac acc ttg egg 3120 
Ala Asp Arg Ser Arg Ser Gin Val He Met Thr Trp His Thr Leu Arg 
1025 10 30 1035 1040 

cag cag acg gee aaa ccg gca ggg cgt ccc aat ctg gca ctg get gat 3168 
Gin Gin Thr Ala Lys Pro Ala Gly Arg Pro Asn Leu Ala Leu Ala Asp 
1045 1050 1055 

ttc att gcg ccg cgt gaa acc gga ctg gac gat acc ate ggt ttg ttt 3216 
Phe He Ala Pro Arg Glu Thr Gly Leu Asp Asp Thr He Gly Leu Phe 
1060 10 65 1070 

gec gtc age gec ggt ttc ggt ate gat gaa cgc ata cgc get ttt gaa 3264 
Ala Val Ser Ala Gly Phe Gly He Asp Glu Arg He Arg Ala Phe Glu 
1075 1080 1085 

get gca aac gat gat tac agt gee ate ate ctg aaa gca ctg get gat 3312 
Ala Ala Asn Asp Asp Tyr Ser Ala He He Leu Lys Ala Leu Ala Asp 
1090 1095 HOO 

cgt ctg get gaa gcg ttt gca gaa cac atg cat gca egg gtg egg cga 3360 
Arg Leu Ala Glu Ala Phe Ala Glu His Met His Ala Arg Val Arg Arg 
1105 mo ins " 1120 

gaa ttc tgg ggc tat gtg aaa gat gag agt ctg gac aat gaa cag ttg 3408 
Glu Phe Trp Gly Tyr Val Lys Asp Glu Ser Leu Asp Asn Glu Gin Leu 
1125 H30 H35 



ate gac gag caa tac ctg gga ate cgt cca gca cca ggt tat cct gee 
He Asp Glu Gin Tyr Leu Gly He Arg Pro Ala Pro Gly Tyr Pro Ala 
1140 H45 H50 



gaa aaa cgc age gga ate gtc ata acg gaa tea ttt gec atg gtg ccg 

Glu Lys Arg Ser Gly He Val He Thr Glu Ser Phe Ala Met Val Pro 
1170 H75 H80 

act gca gca gta tec ggc ttc tat etc tct tac cct gaa tec age tat 

Thr Ala Ala Val Ser Gly Phe Tyr Leu Ser Tyr Pro Glu Ser Ser Tyr 
1185 H90 H95 1200 

ttt get gtt gga aaa ate gga aaa gat cag gtc gag gat tat gca aga 

Phe Ala Val Gly Lys He Gly Lys Asp Gin Val Glu Asp Tyr Ala Arg 
1205 1210 1215 

cgc aaa ggg tgg acg ctg gaa gaa gca gaa agg tgg ctt gcg cct gtc 

Arg Lys Gly Trp Thr Leu Glu Glu Ala Glu Arg Trp Leu Ala Pro Val 
1220 1225 1230 

ttg gcg tat gag cgt taa 
Leu Ala Tyr Glu Arg 
1235 



3456 



tgc cct gat cat acc gaa aag ggg cca ttg ttc get ctg ctg gaa gcg 3504 
Cys Pro Asp His Thr Glu Lys Gly Pro Leu Phe Ala Leu Leu Glu Ala 
1155 iiso H65 



3552 



3600 



3648 



3696 



3714 



<210> 30 
<211> 1237 
<212> PRT 

<213> Nitrosomas europeae 
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<400> 30 

Met Thr Met His Glu Arg Ala Asp Leu Leu Lye Arg Leu Leu Ala Glu 
15 10 15 

Arg. He Leu Met Leu Asp Gly Ala Met Gly Thr Met He Gin Ser Tyr 
20 25 30 

Lys Leu Thr Glu Ser Asp Tyr Arg Gly Glu Arg Phe Ala Asp Phe Pro 
35 40 45 

His Asp Leu Lys Gly Asn Asn Asp Leu Leu Cys Leu Thr Arg Pro Glu 
50 55 60 

Val He Arg Ser He His Arg Ala Tyr Leu Glu Ala Gly Ser Asp He 
65 70 75 80 

He Glu Thr Asn Thr Phe Asn Ser Asn Ala Pro Ser Met Ala Asp Tyr 
85 90 95 

Hie Met Gin Asp Leu Val Tyr Glu Leu Asn Val Ala Gly Ala Arg Leu 
100 105 no 

Ala Cys Glu Glu Ala Arg Ala Met Glu Thr Gin Gin Pro Asp Arg Pro 
115 120 125 

Arg Phe Val Ala Gly Val He Gly Pro Thr Thr Lys Thr Ala Ser Leu 
I 30 135 140 

Ser Pro Asp Val Asn Asp Pro Gly Phe Arg Ala He Thr Phe Asp Asp 
145 150 155 160 

Leu val Glu Ser Tyr Thr Glu Ser Val Arg Gly Leu He Asp Gly Gly 
165 170 175 

Ala Asp He Leu Leu Val Glu Thr He Phe Asp Thr Leu Asn Ala Lys 
180 185 iso 

Ala Ala Leu Phe Ala He Asp Gin Tyr Phe Glu Thr His Gly Leu Arg 
!*5 2 oo 205 

Leu Pro val Met He Ser Val Thr He Thr Asp Ala Ser Gly Arg Asn 
21° 215 220 

Leu Ser Gly Gin Thr Pro Glu Ala Phe Trp Asn Ser Val Arg His Ala 
225 230 235 240 

Arg Pro Leu Ser Val Gly He Asn Cys Ala Leu Gly Ala Glu Leu Met 
245 250 255 

Arg Pro Tyr Val Glu Glu Leu Ser Asn Val Ala Glu Val Phe Thr Ser 
260 265 270 

Ala His Pro Asn Ala Gly Leu Pro Asn Pro Leu Ala Glu Thr Gly TVr 
275 280 285 

Asp Glu Thr Pro Glu Tyr Thr Ala Arg Leu He Lys Asp Phe Ala Gin 
290 295 3 0 o 

Ser Gly Phe Val Asn He Val Gly Gly cys Cys Gly Thr Thr Pro Lys 
305 310 315 320 

His He Ala Ala He Ala Glu Ala Val Arg Asp He Pro Pro Arg Pro 
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325 330 335 

Leu Pro Asp He Pro Lys Lys Leu Arg Leu Ser Gly Leu Glu Pro Leu 
340 345 350 

Asn He Asp Glu His Ser Leu Phe Val Asn Val Gly Glu Arg Thr Asn 
355 360 365 

Val Thr Gly Ser Lys Ala Phe Ala Arg Leu He Leu Asn Gly Gly Tyr 
370 375 380 

Ala Glu Gly Leu Val He Ala Arg Ser Gin Val Glu Asn Gly Ala Gin 
385 390 395 400 

He He Asp He Asn Met Asp Glu Ala Met Leu Asp Ser Gin Lys Ala 
405 410 415 

Met Val Thr Phe Leu Asn Leu Leu Ala Ala Glu Pro Asp He Ser Arg 
420 425 430 

Leu Pro He Met Leu Asp Ser Ser Lys Trp Ser Val He Glu Ala Gly 
435 440 445 

Leu Lys Cys Val Gin Gly Lys Ala Val lie Asn Ser He Ser Leu Lys 
450 455 460 

Glu Gly Glu Ala Glu Phe Leu His His Ala Arg Leu Ala Arg Arg Tyr 
465 470 475 480 

Gly Ala Ala Val He Val Met Ala Phe Asp Glu Thr Gly Gin Ala Asp 
485 490 495 

Thr Leu Gin Arg Lys Val Glu He Cys Thr Arg Cys Tyr His Thr Leu 
500 505 sio 

He Glu Gin Ala Asp Phe Pro Pro Glu Asp He He Phe Asp Pro Asn 
515 520 525 

He Phe Ala lie Ala Thr Gly He Glu Glu His Ser Asn Tyr Ala Val 
530 535 54Q 

Asp Phe He Glu Ala Thr His Val He Arg Gin Thr Leu Pro Tyr Ala 
545 550 555 560 

Lys Val Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn 
565 570 575 

Glu Pro He Arg Glu Ala He His Thr Ala Phe Leu Tyr His Ala Val 
580 585 590 

Lys Ala Gly Met Thr Met Gly He Val Asn Ala Gly Gin Leu Gly Val 
595 600 605 

Tyr Ser Asp He Pro Pro Asp Leu Leu Glu His Val Glu Asp Val Leu 
610 615 620 

Leu Asn Arg Arg Pro Asp Ala Thr Glu Arg Leu Val Glu Phe Ala Glu 
625 630 635 640 

His Phe Lys Gly Gin Lys Lys Glu Gin He Glu Asp Leu Ser Trp Arg 
645 650 655 
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Asp Glu Pro Val Arg Gin Arg Leu He His Ala Leu Val Arg Gly He 
660 665 670 

Ser Thr Tyr He Val Glu Asp Thr Glu Leu Val Arg Gin Glu He Asp 
675 680 685 

Ser Gin Gly Gly Lys Pro He Glu Val He Glu Gly Pro Leu Met Asp 
690 695 700 

Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu 
705 710 715 720 

Pro Gin Val Val Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala Tyr 
725 730 735 

Leu Leu Pro Tyr He Glu Ala Glu Lys Lys He Ser Gly Asp Ser Lys 
740 745 750 

Pro Lys Gly Lys Val Val He Ala Thr Val Lys Gly Asp Val His Asp 
755 760 765 

He Gly Lys Asn He Val Ser Val Val Leu Gin Cys Asn Asn Phe Glu 
770 775 780 

Val He Asn Met Gly Val Met Val Pro Ser Ala Gin He Leu Glu Thr 
785 790 795 800 

Ala Arg Arg Glu Gin Val Asp Met He Gly Leu Ser Gly Leu He Thr 
805 810 815 

Pro Ser Leu Glu Glu Met Ala His Val Ala Arg Glu Met Glu Arg Glu 
820 825 830 

Gin Phe Thr Val Pro Leu Leu lie Gly Gly Ala Thr Thr Ser Arg Met 
835 840 845 

His Thr Ala Val Lys He Ala Pro His Tyr Gly Gly Val Thr Val Trp 
850 855 860 

Val Pro Asp Ala Ser Arg Ala Val Gly Val Cys Ser Asn Leu Met Ser 
865 870 875 880 

Gin Asp Leu Arg Asp Asp Tyr Val Arg Gin Val Lys Ala Glu Gin Glu 
885 890 895 

Lys Ser Arg Val Gin His Arg Asn Lys Lys Gly Pro Ser Lys Leu Leu 
900 905 910 

Thr Phe Glu Glu Ala Arg Ala Asn Ala Leu Lys Thr Asp Trp Ala Arg 
915 920 925 

Tyr Thr Pro Pro Ala Pro Asp Phe Leu Gly Leu Arg Thr Leu Asn Asn 
930 935 940 

Tyr Pro Leu Glu Thr Leu Val Pro His He Asp Trp Thr Pro Phe Phe 
945 950 955 960 

Gin Ala Trp Glu Leu His Gly Arg Tyr Pro Ala He Leu Gin Asp Glu 
965 970 975 

Leu Val Gly Glu Ala Ala Ser Asn Leu Phe Arg Asp Ala Gin Asn Met 
980 985 990 



{ 
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Leu Arg Lys He Val Glu Gin Lys Trp Leu Thr Ala Asn Ala Val He 
995 1000 1005 

Gly Leu Phe Pro Ala Asn Thr Val Asn Gly Asp Asp He Glu He Tyr 
1010 1015 1020 

Ala Asp Arg Ser Arg Ser Gin Val He Met Thr Trp His Thr Leu Arg 
1° 25 1030 1035 1040 

Gin Gin Thr Ala Lys Pro Ala Gly Arg Pro Asn Leu Ala Leu Ala Asp 
1045 1050 1055 

Phe He Ala Pro Arg Glu Thr Gly Leu Asp Asp Thr He Gly Leu Phe 
1060 1065 1070 

Ala Val Ser Ala Gly Phe Gly He Asp Glu Arg He Arg Ala Phe Glu 
1075 1080 1085 

Ala Ala Asn Asp Asp Tyr Ser Ala He He Leu Lys Ala Leu Ala Asp 
1090 1095 iioo 

Arg Leu Ala Glu Ala Phe Ala Glu His Met His Ala Arg Val Arg Arg 
11° 5 HI© 1115 ~ H20 

Glu Phe Trp Gly Tyr Val Lys Asp Glu Ser Leu Asp Asn Glu Gin Leu 
H25 H30 

He Asp Glu Gin Tyr Leu Gly He Arg Pro Ala Pro Gly Tyr Pro Ala 
H40 H45 H50 

Cys Pro Asp His Thr Glu Lys Gly Pro Leu Phe Ala Leu Leu Glu Ala 
1155 U60 H65 

Glu Lys Arg Ser Gly He Val He Thr Glu Ser Phe Ala Met Val Pro 
H70 1175 H80 

Thr Ala Ala Val Ser Gly Phe Tyr Leu Ser Tyr Pro Glu Ser Ser Tyr 
ll 85 H90 1195 1200 

Phe Ala Val Gly Lys He Gly Lys Asp Gin Val Glu Ab P Tyr Ala Arg 
1205 1210 1215 

Arg Lys Gly Trp Thr Leu Glu Glu Ala Glu Arg Trp Leu Ala Pro Val 
1220 1225 1230 

Leu Ala Tyr Glu Arg 
1235 



<210> 31 
<211> 3774 
<212> DNA 

<213> Bordetella pertussis 



<220> 
<221> CDS 
<222> (1) (3771) 
<223> RBP00104 



<220> 

<221> unsure 
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<222> 205 205 

<223> All occurrences of n indicate any nucleotide 
<220> 

<221> unsure 
<222> 277 277 

<223> All occurrences of n indicate any nucleotide 
<400> 31 

gtg cct tat ccc cgt ate ccc ttc ccg ctg tec gec tac acq cat ggc 46 
Val Pro Tyr Pro Arg He Pro Phe Pro Leu Ser Ala Tyr Thr His Qly 
1 5 10 15 

ggc gag ttc gtc cgc caa ctg gac aag cgc ate ctg ate ctg gat ggt 96 
Gly Glu Phe Val Arg Gin Leu Asp Lys Arg He Leu He Leu Asp Gly 
20 25 30 

gec atg ggc acg atg ate cag cgc tac aag ctg ggc gag gec gat ttc 144 
Ala Met Gly Thr Met He Gin Arg Tyr Lys Leu Gly Glu Ala Asp Phe 
35 40 45 

cgt ggc gag cgc ttc gec gag cac cac aag gat etc aag ggc gac aac 192 
Arg Gly Glu Arg Phe Ala Glu His His Lys Asp Leu Lys Gly Asp Asn 
50 55 60 

gaa ctg ctg teg ntg gtg cgc ccg gac gtg ate gcg gaa ate cac egg 240 
Glu Leu Leu Ser Xaa Val Arg Pro Asp Val He Ala Glu He His Arg 
65 70 75 80 

cag tac etc gag gec ggc gec gac gtg ate gag ace nac acc ttc ggc 288 
Gin Tyr Leu Glu Ala Gly Ala Asp Val He Glu Thr Xaa Thr Phe Gly 
85 90 95 

gec acg teg ate gee cag ggc gat tac gac ctg ccg gag ctg gec tac 336 
Ala Thr Ser lie Ala Gin Gly Asp Tyr Asp Leu Pro Glu Leu Ala Tyr 
1Q 0 105 no 

gag atg aac ctg gag teg gec cgc ctg gcg cgc gee gee tgc gac gee 384 
Glu Met Asn Leu Glu Ser Ala Arg Leu Ala Arg Ala Ala Cys Asp Ala 



115 120 



125 



tac age acg ccc gag cat ccg cgc ttc gtg gee ggg gcg ctg ggg ccg 432 
Tyr Ser Thr Pro Glu His Pro Arg Phe Val Ala Gly Ala Leu Gly Pro 
130 135 140 

cag ccc aag acc gcg tec ate teg ccc gac gtc aac gac ccg ggg gcg 480 
Gin Pro Lys Thr Ala Ser lie Ser Pro Asp Val Asn Asp Pro Gly Ala 
145 150 155 * 160 

cgc aac gtc acc ttc gac gag ctg cgc gcg gec tat gtc gag cag etc 528 
Arg Asn Val Thr Phe Asp Glu Leu Arg Ala Ala Tyr Val Glu Gin Leu 
165 170 175 

aat ggc ctg etc gac ggc ggc ate gac ate gtc ctg ate gaa acc ate 576 
Asn Gly Leu Leu Asp Gly Gly He Asp He Val Leu He Glu Thr He 
180 185 190 

ttc gat acg etc aac gec aag gcg gec ate ttc gec gtc gag gaa gcg 624 
Phe Asp Thr Leu Asn Ala Lys Ala Ala He Phe Ala Val Glu Glu Ala 
195 200 205 

ttc gag gcg cgc ggc gtg cgc ctg ccg gtg atg att teg ggc acc gtg 672 
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Phe Glu Ala Arg Gly Val Arg Leu Pro Val Met He Ser Gly Thr Val 
210 215 220 

acc gat gcg teg ggc cgc ate ctg tec ggc cag ace gtc gag gcg ttc 720 
Thr Asp Ala Ser Gly Arg He Leu Ser Gly Gin Thr Val Glu Ala Phe 
225 230 235 240 

tgg aac teg gtg cgc cat gcg egg ccg gtc acc ate ggc ctg aac tgc 768 
Trp Asn Ser Val Arg His Ala Arg Pro Val Thr He Gly Leu Asn Cys 
245 250 255 

gcg ctg ggc gcg gcg ctg atg cgt ccg tat gtg gee gag ctg tec aag 816 
Ala Leu Gly Ala Ala Leu Met Arg Pro Tyr Val Ala Glu Leu Ser Lye 
260 265 270 

ate tgc gac acc tat gtg tgc gtc tat ccc aac gee ggc ctg ccc aat 864 
He Cys Asp Thr Tyr Val Cys Val Tyr Pro Asn Ala Gly Leu Pro Asn 
275 280 205 

ccc atg gec gag acg ggc ttt gac gaa acg ccg gee gat acc teg gee 912 
Pro Met Ala Glu Thr Gly Phe Asp Glu Thr Pro Ala Asp Thr Ser Ala 
290 295 300 

ctg ctg gaa gag ttc gec cag gee ggg ctg gtc aac atg gec ggc ggc 960 
Leu Leu Glu Glu Phe Ala Gin Ala Gly Leu Val Asn Met Ala Gly Gly 
305 310 315 320 

tgt tgc ggc acc acg ccc gag cac ate cgc gec ate gec ggc aag gtg 1008 
Cys Cys Gly Thr Thr Pro Glu His He Arg Ala He Ala Gly Lys Val 
325 330 335 

gee gcg ctg acg ccg cgc gcg gtg ccc gag gtg ccg gtc aag acc cgc 1056 
Ala Ala Leu Thr Pro Arg Ala Val Pro Glu Val Pro Val Lys Thr Arg 
340 345 350 

ctg teg ggc ctg gag gcg etc aac ate gac gac gag act ctg ttc gtc 1104 
Leu Ser Gly Leu Glu Ala Leu Asn He Asp Asp Glu Thr Leu Phe Val 
355 360 365 

aac gtg ggc gag cgc acc aac gtg acg ggc age aag atg ttc gee cgc 1152 
Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Lys Met Phe Ala Arg 
370 375 . 380 

ctg gtc cgc gag gag aaa tac gac gag gcg ctg gee gtg gcg cgc cag 1200 
Leu Val Arg Glu Glu LyB Tyr Asp Glu Ala Leu Ala Val Ala Arg Gin 
385 390 395 400 

cag gtc gag aac ggg gee cag ate ate gac gtc aac atg gac gag gcg 1248 
Gin Val Glu Asn Gly Ala Gin He He Asp Val Asn Met Asp Glu Ala 
405 410 415 

atg ctg gac teg gtg gec tgt atg cac cgc ttc etc aac ctg ate gcg 1296 
Met Leu Asp Ser Val Ala Cys Met His Arg Phe Leu Asn Leu He Ala 
420 425 430 

tec gag ccc gac ate gcg egg gtg ccg gtg atg ate gac agt tec aag 1344 
Ser Glu Pro Asp He Ala Arg Val Pro Val Met He Asp Ser Ser Lys 
435 440 445 

tgg gaa gtg ate gag acc ggc ctg aag tgc gtg cag ggc aag gee gtg 1392 
Trp Glu Val He Glu Thr Gly Leu Lys Cys Val Gin Gly Lys Ala Val 
450 455 460 
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gtc aac teg ate tec atg aag gaa ggc gag gag ccg ttc cgc cat cat 1440 
Val Asn Ser lie Ser Met Lys Glu Gly Glu Glu Pro Phe Arg His His 
465 470 475 480 

gcg cgc ctg tgc cgc cgc tac ggc gcg gee atg gtg gtc atg gec ttc 1488 
Ala Arg Leu Cys Arg Arg Tyr Gly Ala Ala Met Val Val Met Ala Phe 
485 490 495 

gac gaa cag ggg cag gec gac teg ctg gag cgc cgc aag gaa ate tgc 1536 
Asp Glu Gin Gly Gin Ala Asp Ser Leu Glu Arg Arg Lys Glii lie Cys 
500 505 " " 510 

ggc cgc gee tac cgt ate ctg gtc gag gaa gag ggc ttc ccg ccc gag 1584 
Gly Arg Ala Tyr Arg He Leu Val Glu Glu Glu Gly Phe Pro Pro Glu 
515 520 525 

gac ate ate ttc gat ccc aac gtg ttc gcg gtg gee ace ggc ate gac 1632 
Asp He He Phe Asp Pro Asn Val Phe Ala Val Ala Thr Gly He Asp 
530 535 540 

gaa cac aat cac tac gee gtc gat ttc ate gaa ggc gcg cgc tgg ate 1680 
Glu His Asn His Tyr Ala Val Asp Phe He Glu Gly Ala Arg Trp He 
545 550 555 560 

cgc gcg aac ctg ccg cat gee cgc att teg ggc ggc ate teg aac gtc 1728 
Arg Ala Asn Leu Pro His Ala Arg He Ser Gly Gly He Ser Asn Val 
565 570 , 575 

age ttc teg ttc cgc ggc aac gag ccg atg cgc gag gcg ate cat ace 1776 
Ser Phe Ser Phe Arg Gly Asn Glu Pro Met Arg Glu Ala He His Thr 
580 585 590 

gtc ttc ctg tac tac gee ate gag gee ggc ctg acg atg ggc ate gtc 1824 
Val Phe Leu Tyr Tyr Ala He Glu Ala Gly Leu Thr Met Gly He Val 
595 600 605 

aac gcg ggc cag ctg ggc gta tat gec gac ctg gcg ccg cac ctg cgc 1872 
Asn Ala Gly Gin Leu Gly Val Tyr Ala Asp Leu Ala Pro His Leu Arg 
610 615 620 

gac ctg gtc gag gac gtc ate ctg gac cgc ccc gag ccg gtg ggc cgc 1920 
Asp Leu Val Glu Asp Val He Leu Asp Arg Pro Glu Pro Val Gly Arg 
625 630 635 640 

age gac teg gec gac gag cgc teg ccc acc gaa egg ctg gtg cag ttt 1968 
Ser Asp Ser Ala Asp Glu Arg Ser Pro Thr Glu Arg Leu Val Gin Phe 
645 650 655 

gee gag acc gtc aag ggc teg ggc gcg aag aag gaa gaa gac ctg acc 2016 
Ala Glu Thr Val Lys Gly Ser Gly Ala LyB Lys Glu Glu Asp Leu Thr 
660 665 670 

tgg cgc acc ggc teg gtc gag cag cgc ctg gcg cat gee ctg gtg cac 2064 
Trp Arg Thr Gly Ser Val Glu Gin Arg Leu Ala His Ala Leu Val His 
675 660 6B5 

ggc ate acc acc ttc ate gtc gag gac acc gag gaa gtg cgc cag cag 2112 
Gly He Thr Thr Phe He Val Glu Asp Thr Glu Glu Val Arg Gin Gin 
690 695 700 

gtc gec gcg cgc ggc ggg cgc acc ate gaa gtg ate gaa ggt ccg ctg 2160 
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Val Ala Ala Arg Qly Oly Arg Thr lie Glu Val He Glu Gly Pro Leu 
705 710 715 720 

atg gac ggc atg aac gtg gtc ggc gac ctg ttc ggc gcg ggc aag atg 

Met Asp Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly Lys Met 
'25 730 735 



2208 



ttc ctg ccg caa gtg gtg aag teg gcg cgc gtg atg aag cag gcg gtg 2256 
Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val Met Lys Gin Ala Val 
740 745 750 

gcg cac ctg att ccc ttc ate gag gag gaa aag cgc cag ate gcg gec 2304 
Ala His Leu He Pro Phe He Glu Glu Glu Lys Arg Gin He Ala Ala 
? 55 760 765 

gcg ggc ggc gat gtg cgc gec aag ggc aag ate gtg ate gec acc gtc 2352 
Ala Gly Gly Asp Val Arg Ala Lys Gly Lys He Val He Ala Thr Val 
7? 0 775 780 

aag ggc gac gtg cac gac ate ggc aag aac ate gtg teg gtg gtc ttg 2400 
Lys Gly Asp Val His Asp He Gly Lys Asn He Val Ser Val Val Leu 
785 "0 79S 800 

cag tgc aat aac ttc gaa gtc gtg aac atg ggc gtg atg gtg ccg tgc 2448 
Gin Cys Asn Asn Phe Glu Val Val Asn Met Gly Val Met Val Pro Cys 
80S eiO 815 

gec cag ate ctg cag aag gee aag gac gag aac gec gac atg ate ggc 2496 
Ala Gin He Leu Gin Lys Ala Lys Asp Glu Asn Ala Asp Met He Gly 
820 825 830 

ctg tec ggc ctg ate acg ccc age etc gaa gag atg gec tac gtg get 2544 
Leu Ser Gly Leu He Thr Pro Ser Leu Glu Glu Met Ala Tyr Val Ala 
835 840 845 

tea gaa atg cag cgc gac ccc tat ttc cgc gag cgc gec atg ccg ctg 2592 
Ser Glu Met Gin Arg Asp Pro Tyr Phe Arg Glu Arg Ala Met Pro Leu 
850 855 860 



2640 



2688 



atg ata ggc ggg gcg acc acc age egg gtc cat acg gcg gtc aag ate 
Met He Gly Gly Ala Thr Thr Ser Arg Val His Thr Ala Val Lys He 
865 870 875 8B0 

gcg ccc aac tac gac ggt ccg gtg ate tac gtg ccc gat gee age cgt 
Ala Pro Asn Tyr Asp Gly Pro Val He Tyr Val Pro Asp Ala Ser Arg 
88S 890 895 

teg gtc ggc gtg gcg acc age etc atg tec gac cag gec ccg gee tat 2736 
Ser val Gly Val Ala Thr Ser Leu Met Ser Lp Gin La Pro Sa Tyr 
908 905 sio 

fS ?i 9 2i 9 r Ct9 !? 9 °f 9 989 taC 999 9at 9t9 C9C C9C »9» ca * 9CC 2784 
Leu Ala Glu Leu Ala Gin Glu Tyr Glu Asp Val Arg Arg Cys His Ala 

515 920 925 

aac cgc aag gcg gtg ccg ctg gtg teg ctg gec gag gcg cgc gcg gcg 2832 
Asn Arg Lys Ala Val Pro Leu Val Ser Leu Ala Glu 111 Arg Ala 111 
930 935 



940 



cgc ccg cag ate gac tgg tec ggc tac cag ccg ccg cgc ccc aag ttc 
tog Pro Gin He Asp Trp Ser Gly Tyr Gin Pro Pro Arg Pro Lys Phe 
945 950 9 5 5 r ?60 



2880 
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ctg ggc egg cgc gec ttc aag age tac gac ctg gee gag ate gcg cgc 2928 
Leu Gly Arg Arg Ala Phe Lys Ser Tyr Asp Leu Ala Glu lie Ala Arg 
9*5 970 975 

tat ate gac tgg ggg ccg ttc ttc cag acg tgg age ctg ttc ggc ccg 2976 
Tyr He Asp Trp Gly Pro Phe Phe Gin Thr Trp Ser Leu Phe Gly Pro 
980 985 990 

ttc ccc gee ate ctg gac gac aag gtg gtg ggc gag cag gcg cgc aag 3024 
Phe Pro Ala He Leu Asp Asp Lys Val Val Gly Glu Gin Ala Arg Lys 
995 1000 1005 

gtc tac gag gaa ggc cag gee atg etc aag cgc ate ate gac ggg cgc 3072 
Val Tyr Glu Glu Gly Gin Ala Met Leu Lys Arg He He Asp Gly Arg 
1010 1015 1020 

tgg ctg ace gee age ggc gtg gtc ggc ttc tat ccg gee aac cgc gtc 3120 
Trp Leu Thr Ala Ser Gly Val Val Gly Phe Tyr Pro Ala Asn Arg Val 
102 5 1030 1035 1040 

aat gac gaa gac ate gag gtc tac gcg gac gag acg cgc age gag atg 3168 
Asn Asp Glu Asp He Glu Val Tyr Ala Asp Glu Thr Arg Ser Glu Met 
1045 1050 1055 

ctg ttc ace tac cgc aac ctg cgc cag cag ggc gtc aag cgc gaa ggc 3216 
Leu Phe Thr Tyr Arg Asn Leu Arg Gin Gin Gly Val Lys Arg Glu Gly 
1060 1065 1070 

gtc age aac aag tgc ctg gee gac tac ate gcg ccg cgc gac age ggc 3264 
Val Ser Asn Lys Cys Leu Ala Asp Tyr He Ala Pro Arg Asp Ser Gly 
1075 1080 1085 

ctg etc gac tac ate ggc atg ttc gee gtg acc gcg ggc ctg ggc ate 3312 
Leu Leu Asp Tyr He Gly Met Phe Ala Val Thr Ala Gly Leu Gly He 
1090 1095 hoc 

gag aag aaa gag gee gag ttc cag gcg gcg ctg gac gac tac tec age 3360 
Glu Lys Lys Glu Ala Glu Phe Gin Ala Ala Leu Asp Asp Tyr Ser Ser 
H05 mo ins H20 

ate atg ctg aag teg ctg gee gac egg ctg gee gag gcg ttc gee gaa 3408 
lie Met Leu Lys Ser Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu 
1125 1130 H35 

tgc atg cac gcg cgc gtg cgc cgc gac ctg tgg ggc tac gcg gcg gac 3456 
Cys Met His Ala Arg Val Arg Arg Asp Leu Trp Gly Tyr Ala Ala Asp 
1140 1145 H50 

gag gcg ctg tec aac gat gag ctg ate gee gag aag tac age ggc ate 3504 
Glu Ala Leu Ser Asn Asp Glu Leu He Ala Glu Lys Tyr Ser Gly He 
U 55 1160 H65 



egg ccg gcg ccc ggc tat ccg gec tgc ccg gag cac gtg gtc aag acg 3552 
Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Val Val Lys Thr 
H70 1175 H80 

gac ctg ttc cgc gtg ctg gac gec gee gac gtc gga atg gag ctg acc 3600 
Asp Leu Phe Arg Val Leu Asp Ala Ala Asp Val Gly Met Glu Leu Thr 
1185 H90 1195 1200 

gac age tac gec atg ttc ccg gec tec age gtc teg ggg ttc tat ttc 3648 
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Asp ser Tyr Ala Met Phe Pro Ala Ser Ser Val Ser Gly Phe Tyr Phe 
1205 1210 1215 

age cac ccc gag teg cag tat ttc aac gtg ggc aac ate ggc gec gac 3696 
Ser His Pro Glu Ser Gin Tyr Phe Asn Val Gly Asn He Gly Ala Asp 
1220 1225 1230 

cag ctg gec gac tac gtg gcg cgc age ggc cgc gee gaa gag gac gtg 3744 
Gin Leu Ala Asp Tyr Val Ala Arg Ser Gly Arg Ala Glu Glu Asp Val 
1235 1240 1245 

cgc cgc acc ctg gcg ccg aac ctg ggc tag 3774 
Arg Arg Thr Leu Ala Pro Asn Leu Gly 
1250 1255 



<210> 32 
<211> 1257 
<212> PRT 

<213> Bordetella pertussis 

<220> j 
<221> unsure 
<222> 69 69 

<223> All occurrences of Xaa indicate any amino acid 
<220> 

<22l> unsure 
<222> 93 . . 93 

<223> All occurrences of Xaa indicate any amino acid 
<400> 32 

Val Pro Tyr Pro Arg He Pro Phe Pro Leu Ser Ala Tyr Thr His Gly 
1 5 10 15 

Gly Glu Phe Val Arg Gin Leu Asp Lye Arg He Leu He Leu Asp Gly 
20 25 30 

Ala Met Gly Thr Met He Gin Arg Tyr Lys Leu Gly Glu Ala Asp Phe 
35 40 45 

Arg Gly Glu Arg Phe Ala Glu His His Lys Asp Leu Lys Gly Asp Asn 
50 55 go 

Glu Leu Leu Ser Xaa Val Arg Pro Asp Val He Ala Glu He His Arg 
65 70 75 60 

Gin Tyr Leu Glu Ala Gly Ala Asp Val He Glu Thr Xaa Thr Phe Gly 
85 90 95 

Ala Thr Ser He Ala Gin Gly Asp Tyr Asp Leu Pro Glu Leu Ala Tyr 
1Q 0 105 no 

Glu Met Asn Leu Glu Ser Ala Arg Leu Ala Arg Ala Ala Cys Asp Ala 
115 120 125 

Tyr Ser Thr Pro Glu His Pro Arg Phe Val Ala Gly Ala Leu Gly Pro 
*30 135 140 

Gin Pro Lys Thr Ala Ser He Ser Pro Asp Val Asn Asp Pro Gly Ala 
145 150 155 160 
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Arg Asn Val Thr Phe Asp Glu Leu Arg Ala Ala Tyr Val Glu Gin Leu 
165 170 175 

Asn Gly Leu Leu Asp Gly Gly He Asp He Val Leu lie Glu Thr He 
180 185 iso 

Phe Asp Thr Leu Asn Ala Lys Ala Ala He Phe Ala Val Glu Glu Ala 
1*5 200 205 

Phe Glu Ala Arg Gly Val Arg Leu Pro Val Met He Ser Gly Thr Val 
210 215 220 

Thr Asp Ala Ser Gly Arg He Leu Ser Gly Gin Thr Val Glu Ala Phe 
225 230 235 240 

Trp Asn Ser Val Arg His Ala Arg Pro Val Thr He Gly Leu Asn Cys 
245 250 255 

Ala Leu Gly Ala Ala Leu Met Arg Pro Tyr Val Ala Glu Leu Ser Lys 
260 265 270 

He Cys Asp Thr Tyr Val Cys Val Tyr Pro Asn Ala Gly Leu Pro Asn 
275 280 285 

Pro Met Ala Glu Thr Gly Phe Asp Glu Thr Pro Ala Asp Thr Ser Ala 
290 295 300 

Leu Leu Glu Glu Phe Ala Gin Ala Gly Leu Val Asn Met Ala Gly Gly 
305 310 315 320 

Cys Cys Gly Thr Thr Pro Glu His He Arg Ala He Ala Gly Lys Val 
325 330 335 

Ala Ala Leu Thr Pro Arg Ala Val Pro Glu Val Pro Val Lys Thr Arg 
340 345 350 

Leu Ser Gly Leu Glu Ala Leu Asn He Asp Asp Glu Thr Leu Phe Val 
355 360 365 

Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Lye Met Phe Ala Arc 
370 375 380 

Leu Val Arg Glu Glu Lys Tyr Asp Glu Ala Leu Ala Val Ala Arg Gin 
385 390 395 400 

Gin Val Glu Asn Gly Ala Gin He He Asp Val Asn Met Asp Glu Ala 
405 410 415 

Met Leu Asp Ser Val Ala Cys Met His Arg Phe Leu Asn Leu He Ala 
420 425 430 

Ser Glu Pro Asp He Ala Arg Val Pro Val Met He Asp Ser Ser Lys 
435 440 445 

Trp Glu Val He Glu Thr Gly Leu Lys Cys Val Gin Gly Lys Ala Val 
450 455 460 

Val Asn Ser He Ser Met Lys Glu Gly Glu Glu Pro Phe Arg His His 
465 470 475 480 

Ala Arg Leu Cys Arg Arg Tyr Gly Ala Ala Met Val Val Met Ala Phe 
485 490 495 
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Asp Glu Gin Gly Gin Ala Asp Ser Leu Glu Arg Arg Lys Glu He Cys 
500 505 510 

Gly Arg Ala Tyr Arg He Leu Val Glu Glu Glu Gly Phe Pro Pro Glu 
515 520 525 

Asp He He Phe Asp Pro Asn Val Phe Ala Val Ala Thr Gly He Asp 
530 535 540 

Glu His Asn His Tyr Ala Val Asp Phe He Glu Gly Ala Arg Trp He 
545 550 555 560 

Arg Ala Asn Leu Pro His Ala Arg He Ser Gly Gly He Ser Asn Val 
565 570 575 

Ser Phe Ser Phe Arg Gly Asn Glu Pro Met Arg Glu Ala He His Thr 
580 585 590 

Val Phe Leu Tyr Tyr Ala He Glu Ala Gly Leu Thr Met Gly He Val 
595 600 60S 

Asn Ala Gly Gin Leu Gly Val Tyr Ala Asp Leu Ala Pro His Leu Arg 
610 615 620 

Asp Leu Val Glu Asp Val He Leu Asp Arg Pro Glu Pro Val Gly Arg 
"5 630 635 640 

Ser Asp Ser Ala Asp Glu Arg Ser Pro Thr Glu Arg Leu Val Gin Phe 
645 650 655 

Ala Glu Thr Val Lys Gly Ser Gly Ala Lys Lys Glu Glu Asp Leu Thr 
660 665 670 

Trp Arg Thr Gly Ser Val Glu Gin Arg Leu Ala His Ala Leu Val His 
675 660 685 

Gly lie Thr Thr Phe lie Val Glu Asp Thr Glu Glu Val Arg Gin Gin 
690 695 700 

Val Ala Ala Arg Gly Gly Arg Thr He Glu Val He Glu Gly Pro Leu 
705 710 715 720 

Met Asp Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly Lys Met 
725 730 735 

Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val Met Lys Gin Ala Val 
740 745 750 

Ala His Leu He Pro Phe He Glu Glu Glu Lys Arg Gin lie Ala Ala 
755 760 765 

Ala Gly Gly Asp Val Arg Ala Lys Gly Lys lie Val lie Ala Thr Val 
770 775 780 

Lys Gly Asp Val His Asp lie Gly Lys Asn lie Val Ser Val Val Leu 
785 790 795 800 

Gin Cys Asn Asn Phe Glu Val Val Asn Met Gly Val Met Val Pro Cys 
80S 810 ' 815 

Ala Gin lie Leu Gin Lys Ala Lys Asp Glu Asn Ala Asp Met He Gly 
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820 825 830 

Leu Ser Gly Leu He Thr Pro Ser Leu Glu Glu Met Ala Tyr Val Ala 
835 840 845 

Ser Glu Met Gin Arg Asp Pro Tyr Phe Arg Glu Arg Ala Met Pro Leu 
850 855 860 

Met He Gly Gly Ala Thr Thr Ser Arg Val His Thr Ala Val Lys He 
865 870 875 880 

Ala Pro Asn Tyr Asp Gly Pro Val He Tyr Val Pro Asp Ala Ser Arg 
885 890 895 

Ser Val Gly Val Ala Thr Ser Leu Met Ser Asp Gin Ala Pro Ala Tyr 
900 90S 910 

Leu Ala Glu Leu Ala Gin Glu Tyr Glu Asp Val Arg Arg Cys His Ala 
915 920 925 

Asn Arg Lys Ala Val Pro Leu Val Ser Leu Ala Glu Ala Arg Ala Ala 
530 935 940 

Arg Pro Gin He Asp Trp Ser Gly Tyr Gin Pro Pro Arg Pro Lys Phe 
945 950 955 ~ 960 

Leu Gly Arg Arg Ala Phe Lys Ser Tyr Asp Leu Ala Glu He Ala Arg 
965 970 975 

Tyr He ABp Trp Gly Pro Phe Phe Gin Thr Trp Ser Leu Phe Gly Pro 
980 985 990 

Phe Pro Ala He Leu Asp Asp Lys Val Val Gly Glu Gin Ala Arg Lys 
995 1000 1005 

Val Tyr Glu Glu Gly Gin Ala Met Leu Lys Arg He lie Asp Gly Arg 
1010 1015 1020 

Trp Leu Thr Ala Ser Gly Val Val Gly Phe Tyr Pro Ala Asn Arg Val 
1025 1030 1035 1040 

Asn Asp Glu Asp He Glu Val Tyr Ala Asp Glu Thr Arg Ser Glu Met 
1045 1050 1055 

Leu Phe Thr Tyr Arg Asn Leu Arg Gin Gin Gly Val Lys Arg Glu Gly 
1060 1065 1070 

Val Ser Asn Lys Cys Leu Ala Asp Tyr He Ala Pro Arg Asp Ser Gly 
1075 1080 1085 

Leu Leu Asp Tyr He Gly Met Phe Ala Val Thr Ala Gly Leu Gly He 
1090 1095 iioo 

Glu Lys Lys Glu Ala Glu Phe Gin Ala Ala Leu Asp Asp Tyr Ser Ser 
1105 mo nig U20 

He Met Leu Lys Ser Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu 
H25 H30 H35 

Cys Met His Ala Arg Val Arg Arg Asp Leu Trp Gly Tyr Ala Ala Asp 
11^0 1145 H50 
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Glu Ala Leu Ser Asn Asp Glu Leu He Ala Glu Lys Tyr Ser Gly He 
1155 H60 H65 

Arg Pro Ala Pro Gly Tyr Pro Ala eye Pro Glu Hie Val Val Lye Thr 
H™ 1175 U80 

Asp Leu Phe Arg Val Leu Asp Ala Ala Asp Val Gly Met Glu Leu Thr 
1185 U95 1200 

Asp Ser Tyr Ala Met Phe Pro Ala Ser Ser Val Ser Gly Phe Tyr Phe 
1205 1210 1215 

Ser His Pro Glu Ser Gin Tyr Phe Asn Val Gly Asn He Gly Ala Asp 
122 0 1225 1230 

Gin Leu Ala Asp Tyr Val Ala Arg Ser Gly Arg Ala Glu Glu Asp Val 
1235 1240 1245 

Arg Arg Thr Leu Ala Pro Asn Leu Gly 
1250 1255 

<210> 33 
<211> 3645 
<212> DNA 

<213> Chlorobium tepidum 

<220> 

<221> CDS 

<222> (1) (3642) 

<223> RCL00420 

<400> 33 

gtg etc gac ggg gec atg ggc acc atg ate cag agg cat ggc etc gac 48 

Val Leu Asp Gly Ala Met Gly Thr Met He Gin Arg His Gly Leu Asp 
1 5 io 15 

gaa cag gac tac egg ggc gag cgt ttc get teg cat gac cat ccg ctg 96 
Glu Gin Asp Tyr Arg Gly Glu Arg Phe Ala Ser His Asp His Pro Leu 
20 25 30 

aag ggc aac aac gac ctt ctt gtc ate acc egg ccc gac ate ate cgt 144 
Lys Gly Asn Asn Asp Leu Leu Val He Thr Arg Pro Asp He He Arg 
35 40 45 

teg ate cac tgc gac ttc etc gac gcg ggt gcg gac ate ate gag acc 192 
Ser He His Cys Asp Phe Leu Asp Ala Gly Ala Asp He He Glu Thr 
50 55 go 

tgc acc ttc aac gec aac ccg ate teg cag teg gac tac cag ttg cag 240 
Cys Thr Phe Asn Ala Asn Pro He Ser Gin Ser Asp Tyr Gin Leu Gin 
65 70 75 80 

gac ttg acc cgc gag ctg aac gtg gcg gcg gca aag ata gec cgc teg 288 
Asp Leu Thr Arg Glu Leu Asn Val Ala Ala Ala Lys He Ala Arg Ser 
85 90 95 

gca gcg gac gag ttc acc gca aag act ccc gac aag ccg cgt ttc gtg 336 
Ala Ala Asp Glu Phe Thr Ala Lys Thr Pro Asp Lys Pro Arg Phe Val 
100 105 no 



gee ggt tec ate gga ccg acc aac aag acg etc teg etc teg ccg gac 



3B4 
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Ala Gly Ser He Gly Pro Thr Asn Lys Thr Leu Ser Leu Ser Pro Asp 
115 120 125 

gtg aac aac ccc ggc ttc cgc gcc gtc acc ttc cag gag atg gtc gat 432 
Val Asn Asn Pro Gly Phe Arg Ala Val Thr Phe Gin Glu Met Val Asp 
130 135 140 

aac tac act gcc cag etc gaa ggc ttg cac gag ggc ggt gtc gat etc 480 
Asn Tyr Thr Ala Gin Leu Glu Gly Leu His Glu Gly Gly Val Asp Leu 
145 150 155 160 

ttg etc gtc gag acg gtg ttc gac aca ctg aac tgc aag gcg gcg etc 528 
Leu Leu Val Glu Thr Val Phe Asp Thr Leu Asn Cys Lys Ala Ala Leu 
165 170 175 

tac get ate gag gag tac gcg gtg aaa acc ggc tgg cag gtg ccc gtg 576 
Tyr Ala He Glu Glu Tyr Ala Val Lys Thr Gly Trp Gin Val Pro Val 
180 185 190 

atg gtc tec ggc acg gtg gtg gac gcg age ggc cgc acc etc tec ggc 624 
Met Val Ser Gly Thr Val Val Asp Ala Ser Gly Arg Thr Leu Ser Gly 
195 200 205 

caa acc acc gag gcg ttc tgg att teg att teg cac atg ccg agt ctg 672 
Gin Thr Thr Glu Ala Phe Trp He Ser He Ser His Met Pro Ser Leu 
210 215 220 

etc teg gtc ggc ctg aac tgc gca etc ggc tec aag cag atg cgc ccc 720 
Leu Ser Val Gly Leu Asn Cys Ala Leu Gly Ser Lys Gin Met Arg Pro 
225 230 235 240 

ttc ate gag gcg etc teg aac ate gcc gaa age tac gtc age gtc tat 768 
Phe He Glu Ala Leu Ser Asn He Ala Glu Ser Tyr Val Ser Val Tyr 
245 250 255 

ccc aac gcg ggc ctg ccg aat gag ttc ggc gag tac gac gac tec ccc 816 
Pro Asn Ala Gly Leu Pro Asn Glu Phe Gly Glu Tyr Asp Asp Ser Pro 
260 265 * 270 

gag tac atg gcc gcg cag ate gcg ggc ttc gcc gaa tea ggc ttc gtg 864 
Glu Tyr Met Ala Ala Gin He Ala Gly Phe Ala Glu Ser Gly Phe Val 
275 280 285 

aac ate gtc ggc ggc tgc tgc ggc acc acg ccg acg cac ate cgc gcc 912 
Asn He Val Gly Gly Cys Cys Gly Thr Thr Pro Thr His He Arg Ala 
290 295 300 



att gcc gaa gcg gtc aag act etc ccg ccg aga aag cgc ccc gcc aac 
He Ala Glu Ala Val Lys Thr Leu Pro Pro Arg Lys Arg Pro Ala Asn 
305 310 315 320 

aag cac gtg ctg agg etc tec ggc etc gaa ccg etc gtg gtt gac gaa 
Lys His Val Leu Arg Leu Ser Gly Leu Glu Pro Leu Val Val Asp Glu 
325 330 335 

acc acc ggc ttc ate aac gtc ggc gag cgc acc aac gtc acc ggt teg 
Thr Thr Gly Phe He Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser 
340 345 350 

cgc aag ttc gcc cgc etc ate aag gag gcc aat tac gac gaa gcg etc 
Arg Lys Phe Ala Arg Leu He Lys Glu Ala Asn Tyr Asp Glu Ala Leu 
355 360 365 



960 



1008 



1056 



1104 
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tec att gec cgc cag cag gtc gag aac ggc gcg cag gtg ate gac gtg 1152 
Ser He Ala Arg Gin Gin Val Glu Asn Gly Ala Gin Val He Asp Val 
370 375 380 

III t°o. C f aC H? 3 3t9 CtC 9ac tcc 9 aa aa 9 ate gtc gaa ttc 1200 
Asn Leu Asp Glu Gly Met Leu Asp Ser Glu Lys Val He Val Glu Phe 

385 3 *° 395 400 

ctg aac etc ate gee tcc gag cct gag ate gee aag gtg ccg gtg atg 1248 
Leu Asn Leu lie Ala Ser Glu Pro Glu lie Ala Lys Val Pro Val Me? 

405 410 415 

ate gac teg teg aaa tgg teg gtc ate gaa aac ggc ctg cgc tgc aec 1296 
lie Asp ser Ser Lys Trp Ser Val lie Glu Asn Gly Leu Arg Cys Thr 
42 <> 425 430 

cag ggc aag age ate gtc aac teg ate age etc aag gag ggc gag gag 1344 
Gin Gly Lys Ser He Val Asn Ser lie Ser Leu LyI llu Gly 5lu 111 
435 440 445 

ctg ttc aag gag cgc get cgc aag ate atg caa tac ggc gcg gcg gcg 1392 
Leu Phe Lys Glu Arg Ala Arg Lys lie Met Gin Tyr Gly LI LI LI 
450 455 4 6 o 

gtg gtc atg gee ttc gac gag cag ggc cag gee gac age ctg cac cgc 1440 
Val Val Met Ala Phe Asp Glu Gin Gly Gin Ala Asp sir Lei His 2g 
465 470 475 480 

cgc ate gag att tgc age cgc gec tac aaa att etc ace gaa gag gtg 1488 
Arg lie Glu lie Cys Ser Arg Ala Tyr Lye lie Leu Thr Glu 111 Va? 

4 °5 490 495 

ggc ttc ccg ccg gag gac ate ate ttt gac ccg aac gtg ctg aec gtg 153 6 
Gly Phe Pro Pro Glu Asp lie lie Phe Asp Pro Asn Va? Lei Tnr Va? 
500 505 510 

La" Thr ml S! !!° n? 9 aaC taC 9C9 CtC 9aC ttC atC 9 aa "84 

Ala Thr Gly He Asp Glu His Asn Asn Tyr Ala Leu Asp Phe He Glu 

515 520 525 

age gtg cgc tgg ate aag cag aac ctg ccg cac gcg aag gtc tcc aac 1632 
Ser Val Arg Trp He Lys Gin Asn Leu Pro His LI LyI Val Ser ?ly 
530 535 ■» 



540 



ggc ate age aac gtt teg ttc tcc ttc cgc ggc aac gag ccg gtg cgc 
Gly He Ser Asn Val Ser Phe Ser Phe Arg Gly Asn 111 PrI Val Arg 
545 550 --- 3 



1680 



555 560 



gag gcg atg cac acc gcg ttc etc tac cac gee ate cac gee ggt etc 1728 
Glu Ala Met His Thr Ala Phe Leu Tyr His Ala He His La lly £u 
565 570 



575 

gac atg ggc ate gtc aac gec gee cag ctt ggc ate tac gaa aaa ate i77fi 
Asp Met Gly He Val Asn Ala Ala Gin Leu G?y He Tyr Glu III ill 
580 585 590 

III d C9 n? 9 , C " ?" 9tC tat 9tc 939 9ac 9*3 cta ct 9 aac cgc cgc 1824 
Asp Pro Glu Leu Leu Val Tyr Val Glu Asp Val Leu Leu Asn Arg Arg 
595 600 60S 

gac gac gec acc gag egg etc gtg gcg ttc get gaa acg ate cgc gac 1872 
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Asp Asp Ala Thr Qlu Arg Leu Val Ala Phe Ala Qlu Thr He Arg Asp 
610 615 620 

ggc ggc gaa aag gcc gag gcc aag aac gcc gaa tgg cgc aac gcc ccg 1920 
Gly Gly Glu Lys Ala Glu Ala Lys Asn Ala Glu Trp Arg Asn Ala Pro 
625 «0 635 " 640 

gtc gag gag egg ctg aaa cac gcg etc gtc aag ggc ate gtt gac tac 1968 
Val Glu Glu Arg Leu Lys His Ala Leu Val Lys Gly He Val Asp Tyr 
645 650 655 

ate gac gag gac ace gaa gag gcc cgc cag etc tac ccg agt ccg ctg 2016 
He Asp Glu Asp Thr Glu Glu Ala Arg Gin Leu Tyr Pro Ser Pro Leu 
660 665 670 

gag gtg ate gag ggg ccg etc atg aac ggc atg aac cac gtc ggc gac 2064 
Glu Val He Glu Gly Pro Leu Met Asn Gly Met Asn His Val Gly Asp 
675 680 685 

etc ttc gcc gaa ggc aag atg ttc ctg cca cag gtg gtc aaa age gcc 2112 
Leu Phe Ala Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala 
690 695 700 

cgc gtc atg aag cgc teg gta get gcg ctg att ccc tat ate gag gag 2160 
Arg Val Met Lys Arg Ser Val Ala Ala Leu He Pro Tyr He Glu Glu 
705 710 715 720 

gag aag teg aaa aac tgc gac acg age gcc aaa gcc aag gtg ctg etc 2208 
Glu Lys Ser Lys Asn Cys Asp Thr Ser Ala Lys Ala Lys Val Leu Leu 
725 730 * 735 

gcc acg gtg aag ggc gac gtg cac gac ate ggc aag aac ate gtg teg 2256 
Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val Ser 
740 745 750 

gtg gtg ctt gcc tgc aac aac ttc gac gtg ate gac ate ggc gtc atg 2304 
Val Val Leu Ala Cys Asn Asn Phe Asp Val He Asp He Gly Val Met 
7 55 760 765 

atg cca tgc gac aag att etc gaa gcg ctg gca gaa cac aag ccc gac 2352 
Met Pro Cys Asp Lys He Leu Glu Ala Leu Ala Glu His Lys Pro Asd 
770 775 780 

gtg etc ggc etc tec ggc etc ate ace ccg teg etc gaa gag atg gcg 24 00 
Val Leu Gly Leu Ser Gly Leu He Thr Pro Ser Leu Glu Glu Met Ala 
785 790 79S 0oo 

cac gtg gcc aaa gag atg gag egg etc ggc atg aac att ccg etc ate 2448 
His Val Ala Lys Glu Met Glu Arg Leu Gly Met Asn He Pro Leu lie 
805 810 815 

ate ggc ggc gcg acc acc teg aag gtg cac acg gcg gtg aaa etc gcg 2496 
He Gly Gly Ala Thr Thr Ser Lys Val Hi B Thr Ala Val Lys Leu Ala 
820 825 830 

ccc tgc tac ccc age ggc gcg gta gta cac gtg etc gac gcc teg cgc 2544 
Pro Cys Tyr Pro Ser Gly Ala Val Val His Val Leu Asp Ala Ser Arg 
835 840 845 

age gtg ccg gtg gtc age aac etc tgc aac ccc gcc cag cgc gac age 2592 
Ser Val Pro Val Val Ser Asn Leu Cys Asn Pro Ala Gin Arg Asp Ser 
850 855 860 
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tat ate gcg gcg ctg aag gat gag cag gag gcg atg cgc aag age cac 2640 
Tyr He Ala Ala Leu Lys Asp Glu Gin Glu Ala Met Arg Lys Ser His 
865 870 875 880 



gec gag cgc atg gcg gca aaa aag tac gtc teg etc gac gee gee cgc 
Ala Glu Arg Met Ala Ala Lys Lys Tyr Val Ser Leu Asp Ala Ala Arg 
885 890 895 



2688 



gac aac cgc etc acc att gac tgg gag gee gaa ace ate gac aag ccc 
Asp Asn Arg Leu Thr He Asp Trp Glu Ala Glu Thr He Asp Lys Pro 
900 90S 910 



2736 



gee cag act ggc gtc acc gtg ctg gag gat gtc acc gtc ggc gcg etc 
Ala Gin Thr Gly Val Thr Val Leu Glu Asp Val Thr Val Gly Ala Leu 
915 920 925 



2784 



cgc ccg tat ate gac tgg gca mcc ttc ttc tgg age tgg gag ctg cac 
Arg Pro Tyr He Asp Trp Ala Xaa Phe Phe Trp Ser Trp Glu Leu His 
930 935 940 



2832 



ggc gtc tat ccg cag att ctg gag gat gaa aag gtc ggc gag gag gca 
Gly Val Tyr Pro Gin He Leu Glu Asp Glu Lys Val Gly Glu Glu Ala 
945 950 955 960 



2880 



acc aaa etc ttc aac gac gec acc get ctg etc gac egg ate gac age 
Thr LyB Leu Phe Asn Asp Ala Thr Ala Leu Leu Asp Arg He Asp Ser 
965 970 975 



2928 



gaa aag ctg etc ggc ate aaa ggc gtg gcg ggc ate ttc ccg gee aac 
Glu Lys Leu Leu Gly He Lys Gly Val Ala Gly He Phe Pro Ala Asn 
980 985 990 



2976 



age ate ggc gac gac ate ttc gtc tat gcg gat gac gag cgc teg ata 3024 
Ser He Gly Asp Asp He Phe Val Tyr Ala Asp Asp Glu Arg Ser He 
995 1000 1005 

ate cgc acc gtg ctg cac acc ctg cgc cag caa ggc gaa aag cac ggc 3072 
He Arg Thr Val Leu His Thr Leu Arg Gin Gin Gly Glu Lys His Gly 
1010 1015 1020 

gaa gcg aac etc gcg ctg gcg gac ttc gtg gee ccg cgc gaa age ggc 3120 
Glu Ala Asn Leu Ala Leu Ala Asp Phe Val Ala Pro Arg Glu Ser Gly 
102 * 1030 1035 ~ 1040 

gtc aac gac tgg ate ggc tgc ttc acc gta acc gee gga etc ggc ate 3168 
Val Asn Asp Trp He Gly Cys Phe Thr Val Thr Ala Gly Leu Gly He 
1045 1050 1055 

cag aat ttg etc gac gag ttc aca gca gag aac gac gac tac cac cgc 3216 
Gin Asn Leu Leu Asp Glu Phe Thr Ala Glu Asn Asp Asp Tyr His Arg 
1060 1065 * 1070 

ate atg aca cag gcg etc gec gac cga ctg gee gaa gcg ttc gca gag 3264 
He Met Thr Gin Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu 
1075 1080 1085 

atg ctg cac gaa aag gtg cgc cgc gaa etc tgg ggc tac gcg ccc ggc 3312 
Met Leu His Glu Lys Val Arg Arg Glu Leu Trp Gly Tyr Ala Pro Gly 
1090 1095 1100 



gaa ate etc ggc aac gaa gag ctg ate gec gaa aag tac cga ggc ate 3360 
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Glu lie Leu Gly Asn Glu Glu Leu He Ala Glu Lye Tyr Arg Gly He 
"OS 1110 ins i 120 

cgc ccc gcc ccc ggc tac ccc gcc tgc ccg gat cac acc gaa aag gca 3408 
Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp His Thr Glu Lys Ala 
1125 H30 U3S 

ate ate ttc gac ctg etc aac get gaa gcg gcc acc ggc gtc acg ctg 3456 
He He Phe Asp Leu Leu Asn Ala Glu Ala Ala Thr Gly Val Thr Leu 
1140 H45 xiso 

acg gaa act ttc gcg atg aac ccc gca gcc tea gtc tgc ggc etc tac 3504 
Thr Glu Thr Phe Ala Met Asn Pro Ala Ala Ser Val Cys Gly Leu Tyr 
1155 H60 H65 

ttc gcc aac ccg gcc teg aaa tac ttc gta etc ggc aag att ggt aag 3552 
Phe Ala Asn Pro Ala Ser Lys Tyr Phe Val Leu Gly Lvb He Gly Lve 
1170 1175 HBO 

gat cag gtc gaa gac tac gcc aac cgc aaa ggg ctg gaa gta gca gaa 3600 
Asp Gin Val Glu Asp Tyr Ala Asn Arg Lys Gly Leu Glu Val Ala Glu 
1185 H90 1195 1200 



gcc gag aag tgg etc gcg ccc teg ctg aac tac gat cca gcg 
Ala Glu Lys Trp Leu Ala Pro Ser Leu Asn Tyr Asp Pro Ala 
1205 1210 



3642 



taa 



3645 



<210> 34 
<211> 1214 
<212> PRT 

<213> Chlorobium tepidura 



<220> 

<221> unsure 
<222> 936 .. 936 

<223> All occurrences of Xaa indicate any amino acid 
<400> 34 

Val Leu Asp Gly Ala Met Gly Thr Met He Gin Arg His Gly Leu Asp 
IS 10 



15 



Glu Gin Asp Tyr Arg Gly Glu Arg Phe Ala Ser His Asp His Pro Leu 
20 25 30 

Lys Gly Asn Asn Asp Leu Leu Val He Thr Arg Pro Asp He lie Arg 
35 40 45 

Ser He His Cys Asp Phe Leu Asp Ala Gly Ala Asp lie lie Glu Thr 
50 55 60 

Cys Thr Phe Asn Ala Asn Pro lie Ser Gin Ser Asp Tyr Gin Leu Gin 
65 70 75 80 

Asp Leu Thr Arg Glu Leu Asn Val Ala Ala Ala Lys lie Ala Arg Ser 
85 90 95 

Ala Ala Asp Glu Phe Thr Ala Lys Thr Pro Asp Lys Pro Arg Phe Val 
100 105 no 
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Ala Gly Ser He Gly Pro Thr Asn Lys Thr Leu Ser Leu Ser Pro Asp 
115 120 125 

Val Asn Asn Pro Gly Phe Arg Ala Val Thr Phe Gin Glu Met Val Asp 
130 135 140 

Asn Tyr Thr Ala Gin Leu Glu Gly Leu His Glu Gly Gly Val Asp Leu 
145 150 155 160 

Leu Leu Val Glu Thr Val Phe Asp Thr, Leu Asn Cys Lys Ala Ala Leu 
165 170 175 

Tyr Ala He Glu Glu Tyr Ala Val Lys Thr Gly Trp Gin Val Pro Val 
180 185 190 

Met Val Ser Gly Thr Val Val Asp Ala Ser Gly Arg Thr Leu Ser Gly 
195 200 205 

Gin Thr Thr Glu Ala Phe Trp He Ser He Ser His Met Pro Ser Leu 
210 215 220 

Leu Ser Val Gly Leu Asn Cys Ala Leu Gly Ser Lys Gin Met Arg Pro 
225 230 235 240 

Phe He Glu Ala Leu Ser Asn He Ala Glu Ser Tyr Val Ser Val Tyr 
245 250 255 

Pro Asn Ala Gly Leu Pro Asn Glu Phe Gly Glu Tyr Asp Asp Ser Pro 
260 265 270 

Glu Tyr Met Ala Ala Gin lie Ala Gly Phe Ala Glu Ser Gly Phe Val 
275 280 285 

Asn He Val Gly Gly Cys Cys Gly Thr Thr Pro Thr His He Arg Ala 
290 295 300 

He Ala Glu Ala Val Lys Thr Leu Pro Pro Arg Lys Arg Pro Ala Asn 
305 310 315 ~ 320 

Lys His Val Leu Arg Leu Ser Gly Leu Glu Pro Leu Val Val Asp Glu 
325 330 335 

Thr Thr Gly Phe He Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser 
340 345 350 

Arg Lys Phe Ala Arg Leu He Lys Glu Ala Asn Tyr Asp Glu Ala Leu 
355 360 365 

Ser lie Ala Arg Gin Gin Val Glu Asn Gly Ala Gin Val lie Asp Val 
370 375 380 

Asn Leu Asp Glu Gly Met Leu Asp Ser Glu Lys Val lie Val Glu Phe 
385 390 395 400 

Leu Asn Leu lie Ala Ser Glu Pro Glu lie Ala Lys Val Pro Val Met 
405 410 415 

lie Asp Ser Ser Lys Trp Ser Val lie Glu Asn Gly Leu Arg Cys Thr 
420 425 430 

Gin Gly Lys Ser lie Val Asn Ser lie Ser Leu Lys Glu Gly Glu Glu 
435 440 445 
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Leu Phe Lys Glu Arg Ala Arg Lys lie Met Gin Tyr Gly Ala Ala Ala 
450 455 460 

Val Val Met Ala Phe Asp Glu Gin Gly Gin Ala Asp Ser Leu His Arg 
465 470 475 480 

Arg lie Glu He Cys Ser Arg Ala Tyr Lys lie Leu Thr Glu Glu Val 
485 490 495 

Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Asn Val Leu Thr Val 
500 505 510 

Ala Thr Gly He Asp Glu His Asn Asn Tyr Ala Leu Asp Phe He Glu 
515 520 525 

Ser Val Arg Trp He Lys Gin Asn Leu Pro His Ala Lys Val Ser Gly 
530 535 540 

Gly He Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg 
545 550 555 560 

Glu Ala Met His Thr Ala Phe Leu Tyr His Ala He His Ala Gly Leu 
565 570 575 

Asp Met Gly He Val Asn Ala Ala Gin Leu Gly He Tyr Glu Glu He 
580 585 590 

Asp Pro Glu Leu Leu Val Tyr Val Glu Asp Val Leu Leu Asn Arg Arg 
595 600 60S 

Asp Asp Ala Thr Glu Arg Leu Val Ala Phe Ala Glu Thr He Arg Asp 
610 615 620 

Gly Gly Glu Lys Ala Glu Ala Lys Asn Ala Glu Trp Arg Asn Ala Pro 
625 630 635 640 

Val Glu Glu Arg Leu Lys His Ala Leu Val Lys Gly He Val Asp Tyr 
645 650 655 

He Asp Glu Asp Thr Glu Glu Ala Arg Gin Leu Tyr Pro Ser Pro Leu 
660 665 670 

Glu Val He Glu Gly Pro Leu Met Asn Gly Met Asn His Val Gly Asp 
675 680 685 

Leu Phe Ala Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala 
690 695 700 

Arg Val Met Lys Arg Ser Val Ala Ala Leu He Pro Tyr He Glu Glu 
705 710 715 720 

Glu Lys Ser Lys Asn Cys Asp Thr Ser Ala Lys Ala Lys Val Leu Leu 
725 730 735 

Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val Ser 
740 745 750 

Val Val Leu Ala Cys Asn Asn Phe Ab P Val He Asp He Gly Val Met 
755 760 765 

Met Pro Cys Asp Lys He Leu Glu Ala Leu Ala Glu His Lys Pro Asp 
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770 775 780 

Val Leu Gly Leu Ser Gly Leu He Thr Pro Ser Leu Glu Glu Met Ala 
78 5 790 795 800 

His Val Ala Lys Glu Met Glu Arg Leu Gly Met Asn He Pro Leu He 
805 810 815 

He Gly Gly Ala Thr Thr Ser Lys Val His Thr Ala Val Lys Leu Ala 
820 825 83? 

Pro Cys Tyr Pro Ser Gly Ala Val Val His Val Leu Asp Ala Ser Arg 
835 840 845 

Ser Val Pro Val Val Ser Asn Leu Cys Asn Pro Ala Gin Arg Asp Ser 
850 855 860 

Tyr He Ala Ala Leu Lys Asp Glu Gin Glu Ala Met Arg Lys Ser His 
865 870 875 ~ 880 

Ala Glu Arg Met Ala Ala Lys Lys Tyr Val Ser Leu Asp Ala Ala Arg 
885 890 895 

Asp Asn Arg Leu Thr He Asp Trp Glu Ala Glu Thr He Asp Lys Pro 
900 905 910 

Ala Gin Thr Gly Val Thr Val Leu Glu Asp Val Thr Val Gly Ala Leu 
915 920 925 

Arg Pro Tyr He Asp Trp Ala Xaa Phe Phe Trp Ser Trp Glu Leu His 
930 935 940 

Gly Val Tyr Pro Gin He Leu Glu Asp Glu Lys Val Gly Glu Glu Ala 
945 950 955 960 

Thr Lys Leu Phe Asn Asp Ala Thr Ala Leu Leu Asp Arg He Asp Ser 
965 970 ~ 975 

Glu Lys Leu Leu Gly He Lys Gly Val Ala Gly He Phe Pro Ala Asn 
980 985 990 

Ser lie Gly Asp Asp lie Phe Val Tyr Ala Asp Asp Glu Arg Ser He 
995 1000 1005 

He Arg Thr Val Leu His Thr Leu Arg Gin Gin Gly Glu Lys His Gly 
1010 1015 1020 

Glu Ala Asn Leu Ala Leu Ala Asp Phe Val Ala Pro Arg Glu Ser Gly 
102S 1030 1035 " 1040 

Val Asn Asp Trp He Gly Cys Phe Thr Val Thr Ala Gly Leu Gly He 
1045 1050 1055 

Gin Asn Leu Leu Asp Glu Phe Thr Ala Glu Asn Asp Asp Tyr His Arg 
1060 1065 1070 

He Met Thr Gin Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu 
1075 1080 1085 

Met Leu His Glu Lys Val Arg Arg Glu Leu Trp Gly Tyr Ala Pro Gly 
1090 1095 iioo 
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Glu lie Leu Gly Asn Glu Glu Leu He Ala Glu Lys Tyr Arg Gly He 
HOS 1110 1115 1120 

Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp His Thr Glu Lys Ala 
H25 H30 

He He Phe Asp Leu Leu Asn Ala Glu Ala Ala Thr Gly Val Thr Leu 
1140 H45 H50 

Thr Glu Thr Phe Ala Met Asn Pro Ala Ala Ser Val Cys Gly Leu TVr 
H55 H60 H65 

Phe Ala Asn Pro Ala Ser Lys Tyr Phe Val Leu Gly Lys He Gly Lys 
H70 H75 liao 

Asp Gin Val Glu Asp Tyr Ala Asn Arg Lys Gly Leu Glu Val Ala Glu 
1185 H90 U95 1200 

Ala Glu Lys Trp Leu Ala Pro Ser Leu Asn Tyr Asp Pro Ala 
1205 1210 

<210> 35 
<211> 3777 
<212> DNA 

<213> Deinococcus radiodurans 

<220> 
<221> CDS 
<222> (1) (3774) 
<223> RDR02645 

<400> 35 

atg age cat cac cca gaa gcg teg get tec gcc aat ccg tec ate aac 48 

Met Ser His His Pro Glu Ala Ser Ala Ser Ala Asn Pro Ser He Asn 
15 io 15 

cat caa ccg tec acc ate acc gag gcc gcc cgc cag cgc ate ctg att 96 
His Gin Pro Ser Thr He Thr Glu Ala Ala Arg Gin Arg He Leu He 
20 25 30 

etc gac ggc gcc tgg ggt acg cag ctt cag cga gcc aac etc acc gaa 144 
Leu Asp Gly Ala Trp Gly Thr Gin Leu Gin Arg Ala Asn Leu Thr Glu 
35 40 45 

gcg gac ttc cgc tgg gac gaa gcc gac ccc acg egg atg tac egg ggc 192 
Ala Asp Phe Arg Trp Asp Glu Ala Asp Pro Thr Arg Met Tyr Arg Gly 
50 55 60 



240 



288 



aac ttc gac ctg ctg caa ctg acc aag cct gac gtg att cgc gcc gtg 
Asn Phe Asp Leu Leu Gin Leu Thr Lys Pro Asp Val He Arg Ala Val 
65 70 75 bo 

cac cgc gcc tat ttc gag gcc gga gcg gac ate gcc age acc aat acc 
His Arg Ala Tyr Phe Glu Ala Gly Ala Asp He Ala Ser Thr Asn Thr 
85 90 95 

ttc aac tec acg acc ate teg cag gcg gat tac ggc acc gag gca ctg 336 
Phe Asn Ser Thr Thr He Ser Gin Ala Asp Tyr Gly Thr Glu Ala Leu 
10° 105 * no 

gcc tac gcc atg aac cgc gag ggg gca agg ctg gcc cgc gaa gtc gcc 384 
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Ala Tyr Ala Met Asn Arg Glu Gly Ala Arg Leu Ala Arg Glu Val Ala 
115 120 125 

gac gag ttc gag gcg cgc gac ggc aaa aag cgc tgg gtg gcg, ggg agt 432 
Asp Glu Phe Glu Ala Arg Asp Gly Lys Lys Arg Trp Val Ala Gly Ser 
130 135 140 

gtc ggt ccc acc aac cgc acc gcg acc ctt tct ccc gac gtg gag egg 480 
Val Gly Pro Thr Asn Arg Thr Ala Thr Leu Ser Pro Asp Val Glu Arg 
145 "0 155 160 

ccc gag ttc cgc aac gtg acc tac gac gac etc gtg gcg gcg tac teg 528 
Pro Glu Phe Arg Asn Val Thr Tyr Asp Asp Leu Val Ala Ala Tyr Ser 
165 170 175 

gag gee ate acc ggg ttg atg gaa ggt ggc gcg gac ctg ctg etc att 576 
Glu Ala lie Thr Gly Leu Met Glu Gly Gly Ala Asp Leu Leu Leu lie 
180 185 190 

gaa acg gtg ttt gac acg ctg aac gec aaa gec gcg ctg ttt gee gcg 624 
Glu Thr Val Phe A B p Thr Leu Asn Ala Lys Ala Ala Leu Phe Ala Ala 
195 200 205 

cag gac gtg ttc gcg gcg cag ggg cgc gag ctg ccg gtc atg etc teg 672 
Gin Asp Val Phe Ala Ala Gin Gly Arg Glu Leu Pro Val Met Leu Ser 
210 215 220 

ggc acc ate acc gac gec teg ggc cgc acg ctg age ggg cag acg ccc 720 
Gly Thr He Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr Pro 
225 230 235 240 

gaa gec ttc gcg gtg age acc gag cac gec ggc etc ttt teg ctg ggc 768 
Glu Ala Phe Ala Val Ser Thr Glu His Ala Gly Leu Phe Ser Leu Gly 
245 250 255 

ctg aac tgc gcg ctg ggc gec gac ctg ctg egg ccc cac ctg cgc gca 816 
Leu Asn Cys Ala Leu Gly Ala Asp Leu Leu Arg Pro Hie Leu Arg Ala 
260 265 ~ 270 

att gcg gcg aac acg gag gcg ctg gtg teg gtt cac ccc aac gcg ggc 864 
He Ala Ala Asn Thr Glu Ala Leu Val Ser Val His Pro Asn Ala Gly 
275 280 285 

etc ccc aac gee ttc ggg gaa tac gac gaa acg ccc gaa cac acg gcg 912 
Leu Pro Asn Ala Phe Gly Glu Tyr Asp Glu Thr Pro Glu His Thr Ala 
290 295 300 



gcg gtg ctg gec gac ttc gee cgc gag ggg ctg gtc aac ate gtg ggc 
Ala Val Leu Ala Asp Phe Ala Arg Glu Gly Leu Val Asn He Val Gly 
305 310 315 



960 



320 



ggc tgc tgc ggc acc aca ccc gag cac ate aaa gcg att gcg gag gcg 1008 

Gly Cys Cys Gly Thr Thr Pro Glu His He Lys Ala He Ala Glu Ala 
325 330 335 

gtg aag gac att ccc ccg cgc cag gcg ctg caa ctg ccg cct tac ctg 1056 

Val Lys Asp He Pro Pro Arg Gin Ala Leu Gin Leu Pro Pro Tyr Leu 

340 345 350 

cgc etc age ggc etc gaa gee ttc acc ctg acg ccg gaa acc aac ttc 1104 

Arg Leu Ser Gly Leu Glu Ala Phe Thr Leu Thr Pro Glu Thr Asn Phe 
355 360 365 
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gtc aac gtg ggc gag cgc acc aac gtg acc ggc agt ccc aag ttc age 1152 
Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Pro Lys Phe Ser 
370 375 300 

aag gcg att ctg gec ggc gac tac gac gca ggg etc aag att gec cgc 1200 
Lya Ala lie Leu Ala Gly Asp Tyr Asp Ala Gly Leu Lye lie Ala Arg 
385 390 395 400 

cag cag gtg acg aac ggc gcg caa ate gtg gac ate aac ttc gac gag 1248 
Gin Gin Val Thr Asn Gly Ala Gin He Val Asp He Asn Phe Asp Glu 



405 410 



415 



ggg atg etc gac ggc gaa gga gcg atg gtc aag ttc etc aac ctg etc 1296 
Gly Met Leu Asp Gly Glu Gly Ala Met Val Lys Phe Leu Asn Leu Leu 
420 425 430 

gee ggg gag ccg gac ate teg cgc gtg ccc ctg atg etc gac teg tec 1344 
Ala Gly Glu Pro Asp lie Ser Arg Val Pro Leu Met Leu Asp Ser Ser 
435 440 445 

aag tgg gag att ctg gaa gcg ggg ctg egg egg gtg cag ggc aag gca 1392 
Lys Trp Glu He Leu Glu Ala Gly Leu Arg Arg Val Gin Gly Lys Ala 
450 455 460 

gtc gtc aac tec ate teg etc aag gac ggc gag gee agg ttt ctg gaa 1440 
Val Val Asn Ser He Ser Leu Lys Asp Gly Glu Ala Arg Phe Leu Glu 
465 470 475 480 

cgc gee egg ctg ctg egg cgc tac ggg gcg gcg gcg gtg gtc atg gec 1488 
Arg Ala Arg Leu Leu Arg Arg Tyr Gly Ala Ala Ala Val Val Met Ala 
485 490 495 

ttc gac gaa cag gga cag gee gac aac etc gec cga cgc egg gag att 1536 
Phe Asp Glu Gin Gly Gin Ala Asp Asn Leu Ala Arg Arg Arg Glu He 
500 505 si© 

ctg ggc cgc gcg tat agg ctg ctg acc gag cag gcg gac ttt ccg ccg 1584 
Leu Gly Arg Ala Tyr Arg Leu Leu Thr Glu Gin Ala Asp Phe Pro Pro 
515 520 525 

cag gac ate att ttc gac ccc aac gtg ctg acc gtt gec acc ggc ate 1632 
Gin Asp He He Phe Asp Pro Asn Val Leu Thr Val Ala Thr Gly He 
530 535 540 

ni 9 Hf 3 «f C ? a ° C9C taC 9C9 Ct9 9ac ttfc atc 9 C 9 acg cgc tgg 1680 
Glu Glu His Asp Arg Tyr Ala Leu Asp Phe He Glu Ala Thr Arg Trp 

545 550 555 * 560 

ti! f 33 H 33 aa ° Ct9 CC9 909 9C9 aag gtg tcg " c 999 att tc * aac 1728 
He Lys Glu Asn Leu Pro Ala Ala Lys Val Ser Gly Gly He Ser Asn 



565 570 



575 



gtc tcg ttc age ttc egg ggc aac aac cac gtg cgc gag gcg atg cac 1776 

Val Ser Phe Ser Phe Arg Gly Asn Asn His Val Arg Glu Ala Met His 
580 585 590 

?i 9 ?? Ctg taC ° aC gC ° atC Cgc gcc 999 ct 9 gac atg ggc atc 1824 

Ala Val Phe Leu Tyr His Ala He Arg Ala Gly Leu Asp Met Gly He 
555 600 605 

gtg aac gcg ggg atg ctg gcg gtg tac gag gac atc gag ccg gag ctg 1872 
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Val Asn Ala Gly Met Leu Ala Val Tyr Glu Asp He Glu Pro Glu Leu 
610 615 620 



cgc gag gcc gtc gag gac gtc att ctg get cgc cgt ccg gac, gec acc 
Arg Glu Ala Val Glu Asp Val He Leu Ala Arg Arg Pro Asp Ala Thr 
625 *30 635 , 640 



1920 



gag cgt ttg ctg acg ctg gcc gac cgc tac aag gac ate aag cgc gaa 1968 
Glu Arg Leu Leu Thr Leu Ala Asp Arg Tyr Lys Asp He Lys Arg Glu 

645 650 ( 655 ( 

agt gcc gcc cag age gcc tgg cgc gac ctg ccg gtg cag gaa egg ctg 2016 
Ser Ala Ala Gin Ser Ala Trp Arg Asp Leu Pro Val Gin Glu Arg Leu 
660 665 670 

egg cac gca ctg gtg cag ggc gtc gcc gac cac gtg gat gag gac gcc 2064 
Arg His Ala Leu Val Gin Gly Val Ala Asp His Val Asp Glu Asp Ala 
675 680 685 

gag gcc gcc tat cag gaa etc ggc age ccg ctg gcc gtc ate gaa ggc 2112 
Glu Ala Ala Tyr Gin Glu Leu Gly Ser Pro Leu Ala Val He Glu Gly 
690 695 700 

ccg ctg atg gac ggc atg aac gtg gtg ggc gac etc ttc ggc gcg ggg 2160 
Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly 
705 710 715 720 

aaa atg ttc ctg ccg cag gtg gtc aaa tec gcc cgc gtg atg, aaa aag 2208 
Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val Met Lys Lys 
725 730 735 

fi 3 2*? !?° ta ° ° tC a ° 9 CCC tat Ctg 9aa 9 a 9 aa 9 9 C 9 9 a « age 2256 
Ala Val Ala Tyr Leu Thr Pro Tyr Leu Glu Ala Glu Lys Ala Glu Ser 

74 ° 745 750 

tec age aag ggc aag gta ctg ctg gcg acc gtc aag ggc gat gtg cac 2304 
Ser Ser Lys Gly Lys Val Leu Leu Ala Thr Val Lys Gly Asp Val His 
7 *5 760 765 

gac ate ggc aag aac ate gtg ggc gtg gtg etc gcc tgc aac ggc tat 2352 
Asp He Gly Lys Asn He Val Gly Val Val Leu Ala Cys Asn Gly Tyr 
770 775 780 

cag gtg acc gac etc ggc gtg atg gtg ccg ggc gag aag att ctg gac 2400 
Gin Val Thr Asp Leu Gly Val Met Val Pro Gly Glu Lys He Leu Asp 
785 790 795 800 

gaa gcc gag egg etc ggt gcc gac gtg ate ggt ctg age ggg ctg att 2448 
Glu Ala Glu Arg Leu Gly Ala Asp Val He Gly Leu Ser Gly Leu lie 
805 eiO 815 

acg cct tec tta gac gaa atg gtg aac gtg gcc cgc gag atg acg cgc 2496 
Thr Pro Ser Leu Asp Glu Met Val Asn Val Ala Arg Glu Met Thr Arg 
820 825 830 

egg ggc gtg aaa act cca ctg ctg ate ggc ggc gcg acg acc age egg 2544 
Arg Gly Val Lys Thr Pro Leu Leu He Gly Gly Ala Thr Thr Ser Arg 
835 840 845 

gcg cac acg gcg gtc aag att gac ccg gcc tac gac ggg acg gta gtg 2592 
Ala His Thr Ala Val Lys lie Asp Pro Ala Tyr Asp Gly Thr Val Val 
850 855 860 
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cac gtg ctg gac gcc age cgc gcc gtg acc gtg acc aac gac ctg ctg 2640 
His Val Leu Asp Ala Ser Arg Ala Val Thr Val Thr Asn Asp Leu Leu 
865 870 875 880 

acc gac gag gcc gcc tac get ggg cgc gtg cag ggc gag tat gac acc 2688 
Thr Asp Glu Ala Ala Tyr Ala Gly Arg Val Gin Gly Glu Tyr Asp Thr 
885 890 895 

ttg cgc gag cgc cac ggc gag egg cag gtg egg ctg att gcg ctg gca 2736 
Leu Arg Glu Arg His Gly Glu Arg Gin Val Arg Leu He Ala Leu Ala 
900 905 910 

gaa gcc cgc gcc cgc gcc ccg caa ctg agt gcc gcc gtg ccc ccc gcg 2784 
Glu Ala Arg Ala Arg Ala Pro Gin Leu Ser Ala Ala Val Pro Pro Ala 
915 920 925 

ccg cac gat ctg ggc cgt cag gtg gtc gaa cag ccc att gcc gag ctg 2832 
Pro His Asp Leu Gly Arg Gin Val Val Glu Gin Pro He Ala Glu Leu 
930 935 940 

ctg ccc ttc ate gac tgg acg ccc ttt ttc ate gcc tgg gag atg aag 2880 
Leu Pro Phe He Asp Trp Thr Pro Phe Phe He Ala Trp Glu Met Lye 
945 950 955 960 

ggc ate tac ccg ggc ate ctg acc gac cct ctg cgt ggc gag gag gcc 2928 
Gly He Tyr Pro Gly He Leu Thr Asp Pro Leu Arg Gly Glu Glu Ala 
965 970 975 

cgc aag ctg ttt gcc gac gcg cag gcg ctg ctg gag cag gtt ate gcc 2976 
Arg Lys Leu Phe Ala Asp Ala Gin Ala Leu Leu Glu Gin Val He Ala 
980 985 990 

gac ggc teg ctg egg gcg cgc ggc gtc ate ggg ctg tgg ccc gcg cac 3024 
Asp Gly Ser Leu Arg Ala Arg Gly Val He Gly Leu Trp Pro Ala His 
995 1000 1005 

ggc gac gac ate gtg ctg gac gat gcg gcg atg ggg cgt ggc gag acg 3072 
Gly Asp Asp He Val Leu Asp Asp Ala Ala Met Gly Arg Gly Glu Thr 
10*0 1015 1020 

ctg gat ttc gag acg cac gaa etc gcc gcc ggg cgc gag ccg ctg ccg 3120 
Leu Asp Phe Glu Thr His Glu Leu Ala Ala Gly Arg Glu Pro Leu Pro 
1025 1030 1035 1040 

aac atg ccg cgc ctg cac acg ctg egg cag cag cgc gac cag acc acg 3168 
Asn Met Pro Arg Leu His Thr Leu Arg Gin Gin Arg Asp Gin Thr Thr 
1045 1050 " 1055 

ccg aac act gcg ctg get gac ttt gtg gcg gaa gga ggc gac cac ate 3216 
Pro Asn Thr Ala Leu Ala Asp Phe Val Ala Glu Gly Gly Asp His He 
1°60 1065 1070 

ggc gcc ttc gcc acg gcc ate ttc ggc gcc gag gag ttg gcg cag cag 3264 
Gly Ala Phe Ala Thr Ala He Phe Gly Ala Glu Glu Leu Ala Gin Gin 
1075 1080 1085 

ttc gag gcg cag cac gac gac tac aac teg att ctg gtc aag gcg gtg 3312 
Phe Glu Ala Gin His Asp Asp Tyr Asn Ser He Leu Val Lys Ala Val 
1090 1095 lioo 

gcc gac cga ctg gcc gag gcc ttt gcc gag aag ctg cac cgc gac gtg 3360 
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Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Lys Leu HIb Arg Asp Val 
1105 mo U15 1120 

cgc gtg egg cac tgg ggt tac gec gag ggc gag gcg etc gac, aac acc 3408 
Arg Val Arg Hie Trp Gly Tyr Ala Glu Gly Glu Ala Leu Asp Asn Thr 
ll 25 1130 lias 

gac etc ate aag gag cgc tat cag ggc ate cgc cct gcg ccc ggc tac 3456 
Asp Leu He Lys Glu Arg Tyr Gin Gly He Arg Pro Ala Pro Gly Tyr 
1140 H45 1150 Y y 

ccc gcg cag ccc gac cac acc gag aaa cgc acc ctg ttt gag ctg ctg 3504 
Pro Ala Gin Pro Asp His Thr Glu Lys Arg Thr Leu Phe Glu Leu Leu 
II 55 1160 ii 6 5 

gac gcg gaa age ate ggc ctg cgc etc acc gag teg tgt gec atg acc 3552 
Asp Ala Glu Ser He Gly Leu Arg Leu Thr Glu Ser Cys Ala Met Thr 
II 70 1175 118O 

ccg gcg gcg gcg gtg teg ggg ctg tac ttc gcg cat ccg gag gec cgt 3600 
Pro Ala Ala Ala Val Ser Gly Leu Tyr Phe Ala His Pro Glu Ala Arg 
I 185 1190 1195 1200 

tat ttc gca gtg ggc cgc ate ggg cgc gac cag gtg gag aac tac gec 3648 
Tyr Phe Ala Val Gly Arg He Gly Arg Asp Gin Val Glu Asn Tyr Ala 
1205 1210 1215 

gec cgt aag ggt tgg act gtg cag gaa gec gag cgc tgg ctg ggg ccg 3696 
Ala Arg Lys Gly Trp Thr Val Gin Glu Ala Glu Arg Trp Leu Gl? Pro 
1220 1225 1230 

ctg ctg gcg tac age gec ggg ccg ggg cca gaa gca age cag aaa gee 3744 
Leu Leu Ala Tyr Ser Ala Gly Pro Gly Pro Glu Ala Ser Gin Lys Ala 
1235 1240 1245 

etc ggc gca gag ctg aca gga gcg caa teg tga 377 7 
Leu Gly Ala Glu Leu Thr Gly Ala Gin Ser 
1250 1255 



<210> 36 
<211> 1258 
<212> PRT 

<213> Deinococcus radiodurans 
<400> 36 

Met Ser His His Pro Glu Ala Ser Ala Ser Ala Asn Pro Ser He Asn 
1 5 10 15 

His Gin Pro Ser Thr He Thr Glu Ala Ala Arg Gin Arg He Leu lie 
20 25 30 

Leu Asp Gly Ala Trp Gly Thr Gin Leu Gin Arg Ala Asn Leu Thr Glu 
35 40 45 

Ala Asp Phe Arg Trp Asp Glu Ala Asp Pro Thr Arg Met Tyr Arg Glv 
50 55 60 

Asn Phe Asp Leu Leu Gin Leu Thr Lys Pro Asp Val He Arg Ala Val 
65 70 75 80 

His Arg Ala Tyr Phe Glu Ala Gly Ala Asp He Ala Ser Thr Asn Thr 
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85 90 95 

Phe Aim Ser Thr Thr lie Ser Gin Ala Asp Tyr Gly Thr Glu Ala Leu 
100 105 no 

Ala Tyr Ala Met Aen Arg Glu Gly Ala Arg Leu Ala Arg Glu Val Ala 
115 120 125 

Asp Glu Phe Glu Ala Arg Asp Gly Lys Lye Arg Trp Val Ala Gly Ser 
130 135 140 

Val Gly Pro Thr Asn Arg Thr Ala Thr Leu Ser Pro Asp Val Glu Arg 
"5 150 155 160 

Pro Glu Phe Arg Asn Val Thr Tyr Asp Asp Leu Val Ala Ala Tyr Ser 
165 170 175 

Glu Ala He Thr Gly Leu Met Glu Gly Gly Ala Asp Leu Leu Leu lie 
180 185 " 190 

Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala Leu Phe Ala Ala 
195 200 205 

Gin Asp Val Phe Ala Ala Gin Gly Arg Glu Leu Pro Val Met Leu Ser 
210 215 220 

Gly Thr He Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr Pro 
225 230 235 240 

Glu Ala Phe Ala Val Ser Thr Glu His Ala Gly Leu Phe Ser Leu Gly 
245 250 255 

Leu Asn Cys Ala Leu Gly Ala Asp Leu Leu Arg Pro His Leu Arg Ala 
260 265 270 

He Ala Ala Asn Thr Glu Ala Leu Val Ser Val His Pro Asn Ala Gly 
275 280 285 

Leu Pro Asn Ala Phe Gly Glu Tyr Asp Glu Thr Pro Glu His Thr Ala 
290 295 300 

Ala Val Leu Ala Asp Phe Ala Arg Glu Gly Leu Val Asn He Val Gly 
305 310 3i5 32 J 

Gly Cys Cys Gly Thr Thr Pro Glu His He Lys Ala He Ala Glu Ala 
325 330 335 

Val Lys Asp lie Pro Pro Arg Gin Ala Leu Gin Leu Pro Pro Tyr Leu 
340 345 350 

Arg Leu Ser Gly Leu Glu Ala Phe Thr Leu Thr Pro Glu Thr Asn Phe 
355 360 365 

Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Pro Lys Phe Ser 
370 375 380 

Lys Ala He Leu Ala Gly Asp Tyr Asp Ala Gly Leu Lys He Ala Arg 
385 390 395 400 

Gin Gin Val Thr Asn Gly Ala Gin He Val Asp He Asn Phe Asp Glu 
405 410 415 
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Gly Met Leu Asp Gly Glu Gly Ala Met Val Lys Phe Leu Asn Leu Leu 
420 425 430 

Ala Gly Glu Pro Asp He Ser Arg Val Pro Leu Met Leu Asp ,Ser Ser 
435 440 445 

Lys Trp Glu He Leu Glu Ala Gly Leu Arg Arg Val Gin Gly Lys Ala 
450 455 460 

Val Val Asn Ser He Ser Leu Lys Asp Gly Glu Ala Arg Phe Leu Glu 
465 470 475 " 400 

Arg Ala Arg Leu Leu Arg Arg Tyr Gly Ala Ala Ala Val Val Met Ala 
485 490 495 

Phe Abp Glu Gin Gly Gin Ala Asp Asn Leu Ala Arg Arg Arg Glu He 
500 505 510 

Leu Gly Arg Ala Tyr Arg Leu Leu Thr Glu Gin Ala Asp Phe Pro Pro 
515 520 525 

Gin Asp He He Phe Asp Pro Asn Val Leu Thr Val Ala Thr Gly He 
530 535 540 

Glu Glu His Asp Arg Tyr Ala Leu Asp Phe He Glu Ala Thr Arg Trp 
545 550 555 560 

He Lys Glu Asn Leu Pro Ala Ala Lys Val Ser Gly Gly He Ser Asn 
565 570 * 575 

Val Ser Phe Ser Phe Arg Gly Asn Asn His Val Arg Glu Ala Met His 
580 585 590 

Ala Val Phe Leu Tyr His Ala He Arg Ala Gly Leu Asp Met Gly He 
595 600 605 

Val Asn Ala Gly Met Leu Ala Val Tyr Glu Asp He Glu Pro Glu Leu 
610 615 620 

Arg Glu Ala Val Glu Asp Val He Leu Ala Arg Arg Pro Asp Ala Thr 
625 630 635 640 

Glu Arg Leu Leu Thr Leu Ala Asp Arg Tyr Lys Asp He Ly B Arg Glu 
645 650 655 

Ser Ala Ala Gin Ser Ala Trp Arg Asp Leu Pro Val Gin Glu Arg Leu 
660 665 670 

Arg His Ala Leu Val Gin Gly Val Ala Asp His Val Asp Glu Asp Ala 
675 680 685 

Glu Ala Ala Tyr Gin Glu Leu Gly Ser Pro Leu Ala Val He Glu Gly 
690 695 700 

Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly 
705 710 7 i5 720 

Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val Met Lys Lys 
72 5 730 735 

Ala Val Ala Tyr Leu Thr Pro Tyr Leu Glu Ala Glu Lys Ala Glu Ser 
740 745 750 
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Ser Ser Lys Gly Lys Val Leu Leu Ala Thr Val Lys Gly Asp Val His 
755 760 765 

Asp lie Gly Lys Asn He Val Gly Val Val Leu Ala Cys Asn Gly Tyr 
770 775 780 

Gin Val Thr Asp Leu Gly Val Met Val Pro Gly Qlu Lys He Leu Asp 
785 790 795 800 

Glu Ala Glu Arg Leu Gly Ala Asp Val He Gly Leu Ser Gly Leu He 
805 810 815 

Thr Pro Ser Leu Asp Glu Met Val Asn Val Ala Arg Glu Met Thr Arg 
820 825 830 

Arg Gly Val Lys Thr Pro Leu Leu He Gly Gly Ala Thr Thr Ser Arg 
835 840 845 

Ala His Thr Ala Val Lys He Asp Pro Ala Tyr Asp Gly Thr Val Val 
850 855 860 

His Val Leu Asp Ala Ser Arg Ala Val Thr Val Thr Asn Asp Leu Leu 
865 870 875 880 

Thr Asp Glu Ala Ala Tyr Ala Gly Arg Val Gin Gly Glu Tyr Asp Thr 
885 890 895 

Leu Arg Glu Arg His Gly Glu Arg Gin Val Arg Leu He Ala Leu Ala 
900 90S 910 

Glu Ala Arg Ala Arg Ala Pro Gin Leu Ser Ala Ala Val Pro Pro Ala 
915 920 925 

Pro His Asp Leu Gly Arg Gin Val Val Glu Gin Pro He Ala Glu Leu 
930 935 940 

Leu Pro Phe He Asp Trp Thr Pro Phe Phe He Ala Trp Glu Met Lys 
945 950 955 960 

Gly lie Tyr Pro Gly He Leu Thr Asp Pro Leu Arg Gly Glu Glu Ala 
965 970 975 

Arg Lys Leu Phe Ala Asp Ala Gin Ala Leu Leu Glu Gin Val He Ala 
980 985 990 

Asp Gly Ser Leu Arg Ala Arg Gly Val He Gly Leu Trp Pro Ala His 
995 1000 1005 

Gly Asp Asp lie Val Leu Asp Asp Ala Ala Met Gly Arg Gly Glu Thr 
1010 1015 1020 

Leu Asp Phe Glu Thr His Glu Leu Ala Ala Gly Arg Glu Pro Leu Pro 
1025 1030 1035 1040 

Asn Met Pro Arg Leu His Thr Leu Arg Gin Gin Arg Asp Gin Thr Thr 
1045 1050 1055 

Pro Asn Thr Ala Leu Ala Asp Phe Val Ala Glu Gly Gly Asp His He 
1060 1065 1070 

Gly Ala Phe Ala Thr Ala He Phe Gly Ala Glu Glu Leu Ala Gin Gin 
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1075 1080 1085 

Phe Glu Ala Gin His Asp Asp Tyr Asn Ser He Leu Val Lys Ala Val 
1090 1095 HOO 

Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Lys Leu His Arg Asp Val 
1105 IHO Ills U20 

Arg Val Arg His Trp Gly Tyr Ala Glu Gly Glu Ala Leu Asp Asn Thr 
ll " 1130 ,1135 

Asp Leu He Lys Glu Arg Tyr Gin Gly He Arg Pro Ala Pro Gly Tyr 
1140 1145 1150 

Pro Ala Gin Pro Asp His Thr Glu Lys Arg Thr Leu Phe Glu Leu Leu 
H55 H60 H65 

Asp Ala Glu Ser He Gly Leu Arg Leu Thr Glu Ser Cys Ala Met Thr 
1170 1175 H80 

Pro Ala Ala Ala Val Ser Gly Leu Tyr Phe Ala His Pro Glu Ala Arg 
H85 H90 H95 1200 

Tyr Phe Ala Val Gly Arg He Gly Arg Asp Gin Val Glu Asn Tyr Ala 
1205 1210 1215 

Ala Arg Lys Gly Trp Thr Val Gin Glu Ala Glu Arg Trp Leu dly Pro 
1220 1225 1230, 

Leu Leu Ala Tyr Ser Ala Gly Pro Gly Pro Glu Ala Ser Gin Lys Ala 
1235 1240 1245 

Leu Gly Ala Glu Leu Thr Gly Ala Gin Ser 
1250 1255 



<210> 37 
<2U> 3642 
<212> DNA 

<213> Clostridium acetobutylicum 

<220> 

<221> CDS 

<222> (1) . . (3639) 

<223> RCA01265 



<400> 37 

ctt atg aat tct tea eta aag aat ttg tta aat aac aaa att tta gtt 48 
Leu Met Asn Ser Ser Leu Lys Asn Leu Leu Asn Asn Lys He Leu Val 

1 5 10 is 

tta gat ggt get atg gga aca tgt att caa tec ttt aat eta gat gaa 96 

Leu Asp Gly Ala Met Gly Thr Cys He Gin Ser Phe Asn Leu Asp Glu 

20 25 30 

ggc gac ttt aaa ggt tec tta tct tgt aca tgt cat tec aat caa aaa 144 

Gly Asp Phe Lye Gly Ser Leu Ser Cys Thr Cys His Ser Asn Gin Lys 

35 40 45 

gga aac aat gat gtt tta aat tta acc aag cca gaa ata ata aaa gaa 192 

Gly Asn Asn Asp Val Leu Asn Leu Thr Lys Pro Glu He He Lvs Glu 

50 55 60 
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ate cac aag aga tac ctt gaa get ggc gca gat ata ata gaa aca aac 
lie Hie Lys Arg Tyr Leu Glu Ala Gly Ma Asp lie He Glu Thr Asn 
65 ™ 75 80 



240 



act ttt aac get act gaa ata tea caa aaa gat tat aat atg caa gat 288 
Thr Phe Asn Ala Thr Glu He Ser Gin Lys Asp Tyr Asn Met Gin Asp 
85 90 95 

aaa ata tat gat att aat ttt aag ggg gca aaa etc gca aag gaa get 336 
Lys He Tyr Asp He Asn Phe Lys Gly Ala Lys Leu Ala Lys Glu Ala 
100 105 no 

tgt act tac tac aca aaa eta aat cct aat aag cct aga ttt get get 384 
Cys Thr Tyr Tyr Thr Lys Leu Asn Pro Asn Lys Pro Arg Phe Ala Ala 
115 120 125 

ggt tct att ggg cct aca aat aga act get tct eta tct cca gat gtt 432 
Gly Ser He Gly Pro Thr Asn Arg Thr Ala Ser Leu Ser Pro Asp Val 
130 135 140 

gaa aat cct ggt ttt aga aat gta acc ttt gat gag eta tgt aat gee 480 
Glu Asn Pro Gly Phe Arg Asn Val Thr Phe Asp Glu Leu Cys Asn Ala 
145 150 155 i 60 

tat aaa cat caa ata gag get eta ata gat gga ggt gta gac ctt ctt 528 
Tyr Lys His Gin He Glu Ala Leu He Asp Gly Gly Val Asp Leu Leu 
165 170 i7 5 

tta att gaa act ata ttt gat act tta aac get aga gca gca ate ttt 576 
Leu He Glu Thr He Phe Asp Thr Leu Asn Ala Arg Ala Ala He Phe 
I 80 185 190 

gca gca gaa aca gta ttt gaa aat aaa aaa ata aaa ctt cct att ata 624 
Ala Ala Glu Thr Val Phe Glu Asn Lys Lys He Lys Leu Pro He He 
195 200 205 

H? 9 2£ a ata 9Ct 9at aaa agt a 9 a ata tta tec ggt caa 672 
He Ser Gly Thr He Ala Asp Lys Ser Gly Arg He Leu Ser Gly Gin 
21° 215 220 

act ctt gac get ttt gca gaa agt tta aaa aac gaa aat ata att get 720 
Thr Leu Asp Ala Phe Ala Glu Ser Leu Lys Asn Glu Asn He He Ala 
225 2 30 235 240 

ata ggg ctt aat tgt tec ttt ggt get gaa gaa ctt ata cct ttt ata 768 
He Gly Leu Asn Cys Ser Phe Gly Ala Glu Glu Leu He Pro Phe He 
245 250 255 

aaa aga etc tct gaa aca caa aat aga tat ata tec ttt cat cca aac 816 
Lys Arg Leu Ser Glu Thr Gin Asn Arg Tyr He Ser Phe His Pro Asn 
260 265 270 

gca gga ctt cca aac tec ctt ggt gaa tat gaa gaa ctg cca gag gaa 864 
Ala Gly Leu Pro Asn Ser Leu Gly Glu Tyr Glu Glu Leu Pro Glu Glu 
275 280 285 

agc 3 " 9t f aaa aaa tta 9 ca ctt 9 aa 99 a ca t tta aat ata 912 
Thr Ala Ser He Val Lys Lys Leu Ala Leu Glu Gly His Leu Asn He 
290 295 300 

gtt gga ggc tgc tgt ggc act aca cca gaa cat ata aga gca ata age 960 
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Val Gly Gly Cys Cys Gly Thr Thr Pro Glu His He Arg Ala He Ser 
305 310 315 320 

age gta gtt aaa ggc att tct cca aga aaa gtt cca aac ttg gaa ccc 1008 
Ser val Val Lys Gly He Ser Pro Arg Lys Val Pro Asn Leu Glu Pro 
325 330 335 

aaa aca att tac age gga eta gaa aac ata aaa att gat aag aac agt 1056 
Lys Thr He Tyr Ser Gly Leu Glu Asn He Lys He Asp Lys Asn Ser 
340 345 350 

aac ttc ata aat ata ggc gaa aga aca aat gta gcg ggc tea aga aaa 1104 
Asn Phe He Asn lie Gly Glu Arg Thr Asn Val Ala Gly Ser Arg Lvs 
355 360 365 

ttc gca agg ctt ata cgt gaa aaa aat tat gag gag get eta acc att 1152 
Phe Ala Arg Leu lie Arg Glu Lys Asn Tyr Glu Glu Ala Leu Thr He 
37 0 375 380 



gca aga cat cag gtt gaa aat ggt gee caa att ata gat ata aat ttt 
Ala Arg Hie Gin Val Glu Asn Gly Ala Gin lie lie Asp He Asn Phe 
385 390 395 400 



aaa gee ata gta aat tec ata agt ctt aag gtt gga gaa gaa aag ttc 
Lys Ala lie Val Asn Ser lie Ser Leu Lys Val Gly Glu Glu Lys Phe 
450 455 460 



cca cct gaa aat ata ata ttt gat cca aat ata eta tct ata gcg aca 
Pro Pro Glu Asn He lie Phe Asp Pro Asn lie Leu Ser lie Ala Thr 
515 520 525 



aaa tgg ata aaa gag aat eta cca tac get aaa gtc age ggt gga gtt 
Lys Trp lie Lys Glu Asn Leu Pro Tyr Ala Lys Val Ser Gly Gly Val 
54 5 550 555 560 



1200 



gat gat gca ctt tta gat get cgc tct gaa atg gaa aca ttt tta aga 1248 
Asp Asp Ala Leu Leu Asp Ala Arg Ser Glu Met Glu Thr Phe Leu Arg 
405 410 415 

ctt att gca agt gaa cct gaa ata tea aaa gtt cca gtt atg ata gac 1296 
Leu lie Ala Ser Glu Pro Glu lie Ser Lys Val Pro Val Met lie Asp 
420 425 430 

tec tct aat ttt gaa gtt tta aaa gtt gga tta aag tct att caa ggt 1344 
Ser Ser Asn Phe Glu Val Leu Lys Val Gly Leu Lys Ser lie Gin Gly 
435 440 445 



1392 



att gaa gag gca aaa ttt ata aag aac ttt ggc get. ggc gta gtt gta 1440 
He Glu Glu Ala Lys Phe lie Lys Asn Phe Gly Ala Gly Val Val Val 
465 470 475 480 

atg gec ttt gac gaa gaa ggt caa gca get act tat gaa aga aaa att 1488 
Met Ala Phe Asp Glu Glu Gly Gin Ala Ala Thr Tyr Glu Arg Lys lie 
485 490 495 

gaa ate tgc aag aga get tat act att etc aca gaa aaa gtt gag ttt 1536 
Glu lie Cys Lys Arg Ala Tyr Thr He Leu Thr Glu Lys Val Glu Phe 
500 505 510 



1584 



gga att gaa gaa cat gac aac tat gca gtt aat tac ata aaa get gtt 1632 
Gly He Glu Glu His Asp Asn Tyr Ala Val Asn Tyr He Lys Ala Val 
530 535 540 



1680 
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age aac etc tec ttt tct ttt agg ggt aat gac gca ata aga aga get 1728 
Ser Ann Leu Ser Phe Ser Phe Arg Gly Asn Asp Ala He Arg Arg Ala 
565 570 575 

atg cat tct gtt ttc ctt tac cat gca ata aac get gga atg gat atg 1776 
Met His Ser Val Phe Leu Tyr His Ala He Asn Ala Gly Met Asp Met 
580 585 590 

ggt att gtt aat cca gca atg att gat, tta tat gac gat ata gat aag 1824 
Gly He Val Asn Pro Ala Met He Asp Leu Tyr Asp Asp He Asp Lys 
595 600 60S 

gat ctt etc gaa aag gtt gag aat gtt gta eta aat aaa tea tct aac 1872 
Asp Leu Leu Glu Lys Val Glu Asn Val Val Leu Asn Lys Ser Ser Asn 
610 615 620 



get tct gaa tea tta eta gaa ttt get caa acg tat aaa aag acg act 
Ala Ser Glu Ser Leu Leu Glu Phe Ala Gin Thr Tyr Lys Lys Thr Thr 



625 630 635 



640 



720 



atg aaa aag get gtt gaa tgt ctt ctt ccc tat ata aac gaa gaa aag 
Met Lys Lys Ala Val Glu Cys Leu Leu Pro Tyr He Asn Glu Glu Lys 
725 730 735 



1920 



gaa acc tta gaa aag cac gag gat gaa tgg cga caa aaa age cca agt 1968 
Glu Thr Leu Glu Lys His Glu Asp Glu Trp Arg Gin Lys Ser Pro Ser 
645 650 655 

gaa agg ttg agt tat get tta gtt aaa gga aat gtt gaa ttt att gaa 2016 
Glu Arg Leu Ser Tyr Ala Leu Val Lys Gly Asn Val Glu Phe lie Glu 
660 665 670 

gaa gat ata gaa gaa gca aga aaa gag tat aca aat gca ctt gaa att 2064 
Glu Asp He Glu Glu Ala Arg Lys Glu Tyr Thr Asn Ala Leu Glu He 
675 680 685 

ata gag gtt cct tta atg aat gga atg aaa aaa gtg ggt aaa ctt ttt 2112 
He Glu Val Pro Leu Met Asn Gly Met Lys Lys Val Gly Lys Leu Phe 
690 695 700 

gga gag gga aaa atg ttt ctt cct caa gta gta aaa agt get aga gtt 2160 
Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val 
705 710 715 



2208 



tct aaa aat cac aat aaa agt get ggt aag gtt gta ttt gca act gtt 2256 
Ser Lys Asn His Asn Lys Ser Ala Gly Lys Val Val Phe Ala Thr Val 
740 745 750 

aaa ggc gat gtt cat gac ata ggc aaa aat ate gta tct gta gtt ctt 2304 
Lys Gly Asp Val His Asp He Gly Lys Asn He Val Ser Val Val Leu 
755 760 765 

tec tgc aac aat ttt gaa gtt ata gat tta gga gta atg gtt ccc cct 2352 
Ser Cys ABn Asn Phe Glu Val He Asp Leu Gly Val Met Val Pro Pro 
770 775 780 

gaa acc ata ctt gaa acg gca aaa cgt gaa aat gca gat ate att get 2400 
Glu Thr He Leu Glu Thr Ala Lys Arg Glu Asn Ala Asp He He Ala 
785 790 795 800 

tta agt ggt tta att aca cct tct ctt aat gaa atg get tat gta get 2448 
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Leu Ser Gly Leu He Thr Pro Ser Leu Asn Glu Met Ala Tyr Val Ala 
805 810 815 

gaa gaa atg aaa agg ctt aat ttt gat ata cca ctt atg gtg ggt ggt 2496 
Glu Glu Met Lys Arg Leu Asn Phe Asp He Pro Leu Met Val Gly Gly 
820 825 830 

get get acc tea aaa act cac aca get tta aaa eta get acg aaa tat 2544 
Ala Ala Thr Ser Lys Thr His Thr Ala Leu Lys Leu Ala Thr Lys Tyr 
835 840 845 

aaa tat gta gta cac agt act gat get tea gat get gtt acc gta gec 2592 
Lys Tyr Val Val His Ser Thr Asp Ala Ser Asp Ala Val Thr Val Ala 
850 855 860 

aaa aat eta atg agt gaa aac aaa ttt act ttc tta gaa aaa tta aat 2640 
Lys Asn Leu Met Ser Glu Asn Lys Phe Thr Phe Leu Glu Lys Leu Asn 
865 870 875 880 

gaa gag tat tct aaa ata aga gag acc ttc tct act aat aag att gaa 2688 
Glu Glu Tyr Ser Lys He Arg Glu Thr Phe Ser Thr Asn Lys He Glu 
885 890 895 

ctt ate tec att caa aac gca aga aaa aac aga ttt act att gac tgg 2736 
Leu He Ser He Gin Asn Ala Arg Lys Asn Arg Phe Thr He Asp Trp 
900 90S 910 

aat aaa act aaa ata act gaa cct aaa ttt gtc ggt ata aaa aaa tta 2784 
Asn Lys Thr Lys He Thr Glu Pro LyB Phe Val Gly He Lys Lys Leu 
915 920 925 

caa get gta cct ata aat gaa tta aga aag tat ata gat tgg act ttc 2832 
Gin Ala Val Pro He Asn Glu Leu Arg Lys Tyr He Asp Trp Thr Phe 
930 935 940 



ttc ttt acg tct tgg gat atg gga atg aat tac ccc aaa ata atg aaa 
Phe Phe Thr Ser Trp Asp Met Gly Met Asn Tyr Pro Lys He Met Lys 
945 950 955 



2880 



960 



gat cct aaa tac gga get gaa get caa aaa etc ttt aag gat gee aat 2928 
Asp Pro Lys Tyr Gly Ala Glu Ala Gin Lys Leu Phe Lys Asp Ala Asn 
965 970 975 

gaa atg ctt gat tta ttg caa aaa gaa aat tta ate act tgt aat gga 2976 
Glu Met Leu Asp Leu Leu Gin Lys Glu Asn Leu He Thr Cys Asn Gly 
980 985 990 

gtt ttt gga ata ttc cca get aat tct gtt aat gat gat ata gaa ate 3024 
Val Phe Gly He Phe Pro Ala Asn Ser Val Asn Asp Asp He Glu He 
?95 1000 1005 

tac act gat aaa gga act gta acc ata aat act ctt cgt cag cag cag 3072 
Tyr Thr Asp Lys Gly Thr Val Thr He Asn Thr Leu Arg Gin Gin Gin 
1010 1015 1020 

ata ctt aaa gac age gat tat aaa get eta tct gat tat ate get cca 3120 
He Leu Lys Asp Ser Asp Tyr Lys Ala Leu Ser Asp Tyr He Ala Pro 
1025 1030 1035 1040 

aag ggt att ggc ate aaa gat tat ata ggt ggt ttt att gta act get 3168 
Lys Gly He Gly He Lys Asp Tyr He Gly Gly Phe He Val Thr Ala 
10 *5 1050 1055 
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gga ata ggt gca aag gaa tat tec gat aaa tta aag aaa aaa tgc gac 3216 
Gly lie Gly Ala Lys Glu Tyr Ser Asp Lys Leu Lys Lya Lys Cys Asp 
1°«0 10 65 ' 1070 

gat tat gga get act atg ctt aaa ctt ata tgc gat aga ctt gca gag 3264 
Asp Tyr Gly Ala Thr Met Leu Lys Leu He Cys Asp Arg Leu Ala Glu 
1075 1080 1085 

gee ttt tea gaa ctt ctt cac eta agg gta aga aaa gaa tac tgg gga 3312 
Ala Phe Ser Glu Leu Leu His Leu Arg Val Arg Lye Glu Tyr Trp Gly 
1090 1095 lioo 



tac tct caa gat gaa aac tta tec tta gaa aaa ctt ctt aaa gga agt 
Tyr Ser Gin Asp Glu Asn Leu Ser Leu Glu Lys Leu Leu Lys Gly Ser 
1105 mo ins 112Q 



gac att aat gaa ata aaa aaa tta tta gat act ctg tta taa 
Asp He Asn Glu He Lys Lys Leu Leu Asp Thr Leu Leu 
1205 1210 



3360 



tac aga ggg ata aaa cca get att gga tat cct tct att ccc gat cac 3408 
Tyr Arg Gly He Lys Pro Ala lie Gly Tyr Pro Ser lie Pro Asp His 
H25 mo 

tct gaa aaa gca aag tta ttt gat tta ctt tta ggt aaa act tea ata 3456 
Ser Glu Lys Ala Lys Leu Phe Asp Leu Leu Leu Gly Lys Thr Ser lie 
H40 1145 iiso 

gga gtg gaa ttg acg gaa agt tat atg atg aat cca act tea agt gta 3504 
Gly Val Glu Leu Thr Glu Ser Tyr Met Met Asn Pro Thr Ser Ser Val 
H55 H60 H65 

tgc ggt ttg tat ttt gca aat gaa cga gca aaa tac ttt aat ata aat 3552 
Cys Gly Leu Tyr Phe Ala Asn Glu Arg Ala Lys Tyr Phe Asn lie Asn 
1170 H75 11 bo 

aaa ata gga aaa gat caa ctt gag gac tat get gtt cga agt aat aaa 3600 
Lys lie Gly Lys Asp Gin Leu Glu Asp Tyr Ala Val Arg Ser Asn Lys 
1185 11^0 1195 1200 



3642 



<210> 38 
<211> 1213 
<212> PRT 

<213> Clostridium acetobutylicum 
<400> 38 

Leu Met Asn Ser Ser Leu Lys Asn Leu Leu Asn Asn Lys lie Leu Val 
1 5 10 15 

Leu Asp Gly Ala Met Gly Thr CyB lie Gin Ser Phe Asn Leu Asp Glu 



20 



25 



30 



Gly Asp Phe Lys Gly Ser Leu Ser Cys Thr Cys His Ser Asn Gin Lys 
35 40 45 

Gly Asn Asn Asp Val Leu Asn Leu Thr Lys Pro Glu lie lie Lys Glu 
50 55 60 

lie His Lys Arg Tyr Leu Glu Ala Gly Ala Asp lie lie Glu Thr Asn 
65 70 75 bo 
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Thr Phe Asn Ala Thr Glu He Ser Gin Lys Asp Tyr Asn Met Gin Asp 
85 90 95 

Lys He Tyr Asp He Asn Phe Lys Gly Ala Lys Leu Ala Lys Glu Ala 
100 105 no 

Cys Thr Tyr Tyr Thr Lys Leu Asn Pro Asn Lys Pro Arg Phe Ala Ala 
115 120 125 

Gly Ser He Gly Pro Thr Asn Arg Thr Ala Ser Leu Ser Pro Asp Val 
130 135 140 

Glu Asn Pro Gly Phe Arg Asn Val Thr Phe Asp Glu Leu Cys Asn Ala 
145 150 155 160 

Tyr Lys His Gin lie Glu Ala Leu He Asp Gly Gly Val Asp Leu Leu 
165 170 175 

Leu He Glu Thr He Phe Asp Thr Leu Asn Ala Arg Ala Ala He Phe 
180 185 190 

Ala Ala Glu Thr Val Phe Glu Asn Lys Lys He Lys Leu Pro He He 
195 200 205 

lie Ser Gly Thr lie Ala Asp Lys Ser Gly Arg He Leu Ser Gly Gin 
210 215 220 

Thr Leu Asp Ala Phe Ala Glu Ser Leu Lys Asn Glu Asn lie lie Ala 
225 230 235 240 

lie Gly Leu Asn Cys Ser Phe Gly Ala Glu Glu Leu lie Pro Phe He 
245 250 255 

Lys Arg Leu Ser Glu Thr Gin Asn Arg Tyr He Ser Phe His Pro Asn 
260 265 270 

Ala Gly Leu Pro Asn Ser Leu Gly Glu Tyr Glu Glu Leu Pro Glu Glu 
275 280 285 

Thr Ala Ser He Val Lys Lys Leu Ala Leu Glu Gly His Leu Asn lie 
290 295 300 

Val Gly Gly Cys Cys Gly Thr Thr Pro Glu His lie Arg Ala lie Ser 
305 310 315 "* 320 

Ser Val Val Lys Gly lie Ser Pro Arg Lys Val Pro Asn Leu Glu Pro 
325 330 335 

Lys Thr lie Tyr Ser Gly Leu Glu Asn lie Lys lie Asp Lys Asn Ser 
340 345 350 

Asn Phe lie Asn lie Gly Glu Arg Thr Asn Val Ala Gly Ser Arg Lys 
355 360 365 

Phe Ala Arg Leu lie Arg Glu Lys Asn Tyr Glu Glu Ala Leu Thr lie 
370 375 380 

Ala Arg His Gin Val Glu Asn Gly Ala Gin He lie Asp lie Asn Phe 
385 390 395 400 

Asp Asp Ala Leu Leu Asp Ala Arg Ser Glu Met Glu Thr Phe Leu Arg 
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405 410 415 

Leu He Ala Ser Glu Pro Glu He Ser Lye Val Pro Val Met He Abo 
420 425 430, 

Ser Ser Asn Phe Glu Val Leu Lys Val Gly Leu Lys Ser He Gin Gly 
435 440 445 

Lys Ala lie Val Asn Ser He Ser Leu Lys Val Gly Glu Glu Lys Phe 
450 455 460 

He Glu Glu Ala Lys Phe He Lys Asn Phe Gly Ala Gly Val Val Val 
465 470 475 480 

Met Ala Phe Asp Glu Glu Gly Gin Ala Ala Thr Tyr Glu Arg Lys He 
485 490 495 

Glu He Cys Lys Arg Ala Tyr Thr He Leu Thr Glu Lys Val Glu Phe 
500 505 510 

Pro Pro Glu Asn He He Phe Asp Pro Asn He Leu Ser He Ala Thr 
515 520 525 

Gly He Glu Glu His Asp Asn Tyr Ala Val Asn Tyr He Lys Ala Val 
530 535 540 

Lys Trp He Lys Glu Asn Leu Pro Tyr Ala Lys Val Ser Gly Gly Val 
545 550 555 560 

Ser Asn Leu Ser Phe Ser Phe Arg Gly Asn Asp Ala He Arg Arg Ala 
565 570 575 

Met His Ser Val Phe Leu Tyr His Ala He Asn Ala Gly Met Asp Met 
580 585 590 

Gly He Val Asn Pro Ala Met He Asp Leu Tyr Asp Asp He Asp Lys 
595 600 605 

Asp Leu Leu Glu Lys Val Glu Asn Val Val Leu Asn Lys Ser Ser Asn 
610 615 620 

Ala Ser Glu Ser Leu Leu Glu Phe Ala Gin Thr Tyr Lys Lys Thr Thr 
625 630 635 6 40 

Glu Thr Leu Glu Lys His Glu Asp Glu Trp Arg Gin Lye Ser Pro Ser 
g 45 650 655 

Glu Arg Leu Ser Tyr Ala Leu Val Lys Gly Asn Val Glu Phe He Glu 
6 60 665 670 

Glu Asp He Glu Glu Ala Arg Lys Glu Tyr Thr Asn Ala Leu Glu He 
6? 5 680 685 

He Glu Val Pro Leu Met Asn Gly Met Lys Lys Val Gly Lys Leu Phe 

695 700 

Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val 
705 710 715 720 

Met Lys Lys Ala Val Glu Cys Leu Leu Pro Tyr He Asn Glu Glu Lys 
725 730 735 
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Ser Lye Asn His Asn Lys Ser Ala Gly Lys Val Val Phe Ala Thr Val 
740 745 750 

Lys Gly Asp Val His Asp lie Gly Lys Asn lie Val Ser Val Val Leu 
755 760 765 

Ser Cys Asn Asn Phe Glu Val lie Asp Leu Gly Val Met Val Pro Pro 
770 775 780 

Glu Thr He Leu Glu Thr Ala Lys Arg Glu Asn Ala Asp He He Ala 
785 790 795 800 

Leu Ser Gly Leu He Thr Pro Ser Leu Asn Glu Met Ala Tyr Val Ala 
805 810 815 

Glu Glu Met Lys Arg Leu Asn Phe Asp He Pro Leu Met Val Gly Gly 
820 B25 830 

Ala Ala Thr Ser Lys Thr His Thr Ala Leu Lye Leu Ala Thr Lys Tyr 
835 840 845 

Lys Tyr Val Val His Ser Thr Asp Ala Ser Asp Ala Val Thr Val Ala 
850 855 860 

Lys Asn Leu Met Ser Glu Asn LyB Phe Thr Phe Leu Glu Lys Leu Asn 
865 870 875 880 

Glu Glu Tyr Ser Lys He Arg Glu Thr Phe Ser Thr Asn Lys He Glu 
885 890 895 

Leu He Ser He Gin Asn Ala Arg Lys Asn Arg Phe Thr He Asp Trp 
900 90S 910 

Asn Lys Thr Lys He Thr Glu Pro Lys Phe Val Gly He Lys Lys Leu 
915 920 925 

Gin Ala Val Pro He Asn Glu Leu Arg Lys Tyr He Asp Trp Thr Phe 
930 935 940 

Phe Phe Thr Ser Trp Asp Met Gly Met Asn Tyr Pro Lys He Met Lys 
945 950 955 960 

Asp Pro Lys Tyr Gly Ala Glu Ala Gin Lys Leu Phe Lys Asp Ala Asn 
965 970 975 

Glu Met Leu Asp Leu Leu Gin Lys Glu Asn Leu He Thr Cys Asn Gly 
980 985 990 

Val Phe Gly He Phe Pro Ala Asn Ser Val Asn Asp Asp He Glu He 
995 1000 1005 

Tyr Thr Asp Lys Gly Thr Val Thr He Asn Thr Leu Arg Gin Gin Gin 
1010 1015 1020 

lie Leu Lys Asp Ser Asp Tyr Lys Ala Leu Ser Asp Tyr He Ala Pro 
1025 1030 1035 1040 

Lys Gly He Gly lie Lys Asp Tyr lie Gly Gly Phe He Val Thr Ala 
1045 1050 1055 

Gly lie Gly Ala Lys Glu Tyr Ser Asp Lys Leu Lys Lys Lys Cys Asp 
1060 1065 1070 
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Asp Tyr Gly Ala Thr Met Leu Lys Leu He Cys Asp Arg Leu Ala Glu 
1075 1080 1085 

Ala Phe Ser Glu Leu Leu His Leu Arg Val Arg Lys Glu Tyr Trp Gly 
1090 1095 uoo 

Tyr Ser Gin Asp Glu Asn Leu Ser Leu Glu Lys Leu Leu Lys Gly Ser 
^OS U10 1115 H20 

Tyr Arg Gly He Lys Pro Ala He Gly Tyr Pro Ser He Pro Asp His 
1125 H30 H35 

Ser Glu Lys Ala Lys. Leu Phe Asp Leu Leu Leu Gly Lys Thr Ser He 
1140 H45 H50 

Gly Val Glu Leu Thr Glu Ser Tyr Met Met Asn Pro Thr Ser Ser Val 
H55 H60 U65 

Cys Gly Leu Tyr Phe Ala Asn Glu Arg Ala Lys Tyr Phe Asn He Asn 
1170 1175 H80 

Lys He Gly Lys Asp Gin Leu Glu Asp Tyr Ala Val Arg Ser Asn Lys 
1185 H90 U95 ~ 1200 

Asp lie Asn Glu He Lys Lys Leu Leu Asp Thr Leu Leu 
1205 1210 

<210> 39 
<211> 3954 
<212> DNA 

<213> Caulobacter_crescentus 

<220> 

<221> CDS 

<222> (1) (3951) 

<223> RCO02271 

<400> 39 

atg acc gat etc tec ate cgc gec aac cgc gtc gee gee ctg aag gee 48 

Met Thr Asp Leu Ser He Arg Ala Asn Arg Val Ala Ala Leu Lys Ala 
15 10 is 

gec gee aag gag cgt att etc att etc gac ggc tec tgg ggc gtg atg 96 
Ala Ala Lys Glu Arg He Leu He Leu Asp Gly Ser Trp Gly Val Met 
20 25 30 

ttc cag aag aag ggg ctg acc gag gec gac tac cgc gee gag cgc ttc 144 
Phe Gin Lys Lys Gly Leu Thr Glu Ala Asp Tyr Arg Ala Glu Arg Phe 
35 40 45 



gee gec tac aac ggc cag atg aag ggc aat aac gac ate ctg tgc ctg 
Ala Ala Tyr Asn Gly Gin Met Lys Gly Asn Asn Asp He Leu Cys Leu 
5 9 55 60 



192 



acg egg ccc gat etc gtg gee gag ctg cac gac gee tat ttc age gee 240 
Thr Arg Pro Asp Leu Val Ala Glu Leu His Asp Ala Tyr Phe Ser Ala 
65 70 75 80 

ggc gec gac ate tec gag acc aac acc ttc teg ggc acc acc ate gee 288 
Gly Ala Asp He Ser Glu Thr Asn Thr Phe Ser Gly Thr Thr He Ala 
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85 90 95 

cag gcc gac tat cat ctg ggt gaa cag gat gtc tgg gac ate aac ctg 336 
Gin Ala Asp Tyr His Leu Gly Glu Gin Asp Val Trp Asp lie Asn Leu 
100 105 no 

gaa ggc gcc aag ate ggc cgc teg gtg gcc gac cgc tgg aac gcg cag 384 
Glu Gly Ala Lys He Gly Arg Ser Val Ala Asp Arg Trp Asn Ala Gin 
115 120 125 

aat ccc gac cgc ccg aag ttc ate gcc ggc teg atg ggg ccg ctg aac 432 
Asn Pro Asp Arg Pro Lys Phe He Ala Gly Ser Met Gly Pro Leu Asn 
130 135 140 



gtc atg ctg teg atg teg teg gac gtg aac gat ccg ggc gcg cgc aag 
Val Met Leu Ser Met Ser Ser Asp Val Asn Asp Pro Gly Ala Arg Lys 
145 150 155 



160 



tgg gcc aag gac ggc etc gtc aac ate ctg ggc ggc tgc tgc ggc acg 
Trp Ala Lys Asp Gly Leu Val Asn He Leu Gly Gly Cye cy B Gly Thr 
305 310 315 



320 



480 



gtg ace ttc gac cag gtc tac gag gcc tat cgc cag cag gtg gat gcg 528 
Val Thr Phe Asp Gin Val Tyr Glu Ala Tyr Arg Gin Gin Val Asp Ala 
165 170 175 

ctt tac cag ggc ggg gtc gat etc ttc ctg ate gag ace ate acc gac 576 
Leu Tyr Gin Gly Gly Val Asp Leu Phe Leu He Glu Thr He Thr Asp 
180 185 190 

acc ctg aac tgc aag gcc gcg ate aag gcg ate ctg gac tgg cgc gac 624 
Thr Leu Asn Cys Lys Ala Ala He Lys Ala He Leu Asp Trp Arg Asp 
195 200 205 

gag ggc cac gag gag ctg ccg ate tgg ate age ggc acc ate acc gat 672 
Glu Gly His Glu Glu Leu Pro He Trp He Ser Gly Thr He Thr Asp 
210 215 220 

cgc teg ggc cgc acc ctg teg ggc cag acg gcc gag gcg ttc tgg aac 720 
Arg Ser Gly Arg Thr Leu Ser Gly Gin Thr Ala Glu Ala Phe Trp Asn 
225 230 235 240 

age gtc aag cac gcc aag ccg ttc gca gtg ggc ttc aac tgc gcc ctg 768 
Ser Val Lys His Ala Lys Pro Phe Ala Val Gly Phe Asn Cys Ala Leu 
245 250 255 

ggc gcg gat ttg atg cgt ccg cac ate gcc gag atg gcc cgt ate gcc 816 
Gly Ala Asp Leu Met Arg Pro His He Ala Glu Met Ala Arg He Ala 
260 265 270 

gac acc ctg gtc gca gcc tat ccc aac gcc ggc ctg ccc aac gcc atg 864 
Asp Thr Leu Val Ala Ala Tyr Pro Asn Ala Gly Leu Pro Asn Ala Met 
275 280 285 

ggc cag tac gac gag gag ccg cac gag acc ggc cac gcc ctg cac gag 912 
Gly Gin Tyr Asp Glu Glu Pro His Glu Thr Gly His Ala Leu His Glu 
290 295 300 



960 



aca ccg gac cac ate cgt cac gtc gcc gac gag gtg cgc ggc gtg acg 1008 
Thr Pro Asp His He Arg His Val Ala Asp Glu Val Arg Gly Val Thr 
325 330 ~ 335 
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ccg cgc cag ate ccc gag cgc ccc aag gec atg cgc ctg gcg ggc etc 1056 
Pro Arg Gin lie Pro Glu Arg Pro Lys Ala Met Arg Leu Ala Gly Leu 
340 345 350 

gaa ccg ttc gag ttg get tag tgg eta egg ccg caa att ccc ttc tec 1104 
Glu Pro Phe Glu Leu Ala Xaa Trp Leu Arg Pro Gin lie Pro Phe Ser 
355 360 365 

cct tgc ggg aga agg tgt cgc cga agg cga egg atg agg ggt etc gec 1152 
Pro Cys Gly Arg Arg Cys Arg Arg Arg Arg Arg Met Arg Gly Leu Ala 
370 375 380 

ggc cct tea ace get gtc teg egg egg cga cgt tct tea ace cct cat 1200 
Gly Pro Ser Thr Ala Val Ser Arg Arg Arg Arg Ser Ser Thr Pro His 
385 390 395 400 

ccg acc cgc tgc gcg ggc cac ctt etc ccg caa ggg gag aag gga tga 1248 
Pro Thr Arg Cys Ala Gly His Leu Leu Pro Gin Gly Glu Lys Gly Xaa 
405 410 415 

ctg eta ttg gat cct gaa atg cgc ccc gtc ttc gtc aac ate ggt gag 1296 
Leu Leu Leu Asp Pro Glu Met Arg Pro Val Phe Val Asn lie Gly Glu 
420 425 430 

cgc acc aac gtc acc ggc teg gec aag ttc aag aag ctg ate gtc gaa 1344 
Arg Thr Asn Val Thr Gly Ser Ala Lys Phe Lys Lys Leu lie Val Glu 
435 440 445 

ggg aac tat ccc gag gcg ctg teg gtc gcg cgc cag cag gtc gag gec 1392 
Gly Asn Tyr Pro Glu Ala Leu Ser Val Ala Arg Gin Gin Val Glu Ala 
450 455 460 

ggg gee cag gtc ate gac gtg aac atg gac gag ggt ctg ctg gac age 1440 
Gly Ala Gin Val lie Asp Val Asn Met Asp Glu Gly Leu Leu Asp Ser 
465 470 475 * 480 

cag cag gec atg gtc acc ttc ctg aat ctg atg gcg gee gag ccc gac 1488 
Gin Gin Ala Met Val Thr Phe Leu Asn Leu Met Ala Ala Glu Pro Asp 
485 490 495 

ate gcg cgc gtg ccg gtg atg ate gac age tec aag tgg gag gtg ate 1536 
He Ala Arg Val Pro Val Met He Asp Ser Ser Lys Trp Glu Val He 
500 505 510 

gag gcg ggc ctg aag tgc gta caa ggc aag gcg ate gtc aac teg ate 1584 
Glu Ala Gly Leu Lys Cys Val Gin Gly Lys Ala He Val Asn Ser He 
515 520 525 

age ctg aag gaa ggc gag gaa aag ttc etc gaa cag gee acg etc tgc 1632 
Ser Leu Lys Glu Gly Glu Glu Lys Phe Leu Glu Gin Ala Thr Leu Cys 
530 535 540 

ctg cgc tat ggc gca gec gtg gtg gtc atg gee ttc gac gag gtt ggc 1680 
Leu Arg Tyr Gly Ala Ala Val Val Val Met Ala Phe Asp Glu Val Gly 
545 550 555 560 

cag gec gac acc gaa aag cgc aag gtc gag ate tgt acg egg gec tac 1728 
Gin Ala Asp Thr Glu Lys Arg Lys Val Glu He Cys Thr Arg Ala Tyr 
565 570 575 

aac acg etc gtg gac aag gtc ggc ttc ccg ccc gag gac ate ate ttc 1776 
Asn Thr Leu Val Asp Lys Val Gly Phe Pro Pro Glu Asp He lie Phe 
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580 585 590 

gac ccc aac ate ttc gec gtg gcg acg ggg ate gag gag cac gac aac 1824 
Asp Pro Asn He Phe Ala Val Ala Thr Gly He Glu Glu His Asp Asn 
595 600 605 

tac gec gtc gac ttc ate gag gee acg egg cgc ate aag cag atg ttg 1872 
Tyr Ala Val Asp Phe He Glu Ala Thr Arg Arg He Lys Gin Met Leu 
610 615 620 

ccc tat gcg egg gtg teg ggc ggg gtg teg aac gtc teg ttc age ttc 1920 
Pro Tyr Ala Arg Val Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe 
625 630 635 640 

egg ggc aat gag ccg gtg cgc egg gcg ate cac teg gtg ttc ctg tac 1968 
Arg Gly Asn Glu Pro Val Arg Arg Ala He His Ser Val Phe Leu Tyr 
645 650 655 

cac gec ate aac gec ggc atg gac atg ggc ate gtc aac gee ggc gac 2016 
His Ala He Asn Ala Gly Met Asp Met Gly He Val Asn Ala Gly Asp 
660 665 670 

ctg ccg gtc tat gac gac ate gat ccg gee ctg cgc gag gee gtc gag 2064 
Leu Pro Val Tyr Asp Asp He Asp Pro Ala Leu Arg Glu Ala Val Glu 
675 680 685 

gac gtg ate etc aac egg ccg cag cgc gat ccg gtg atg acc aac acc 2112 
Asp Val He Leu Asn Arg Pro Gin Arg Asp Pro Val Met Thr Asn Thr 
690 695 700 

gag cgc ctg gtc gag atg gee ccg cgc tat aag ggc gag aag ggg cag 2160 
Glu Arg Leu Val Glu Met Ala Pro Arg Tyr Lys Gly Glu Lys Gly Gin 
705 710 715 " 720 

cag cag gtc gee aac ctg gag tgg cga aag ggc acg gtg aac gag cgc 2208 
Gin Gin Val Ala Asn Leu Glu Trp Arg Lys Gly Thr Val Asn Glu Arg 
725 730 735 

ctg acc cat get etc gtt cac ggc ate acc gag ttc ate gag cag gac 2256 
Leu Thr His Ala Leu Val His Gly He Thr Glu Phe He Glu Gin Asp 
740 745 750 

acc gag gag gcg cgc ctg gee gee gag cgc ccc ttg cac gtg att gaa 2304 
Thr Glu Glu Ala Arg Leu Ala Ala Glu Arg Pro Leu His Val He Glu 
755 760 765 

ggc ccg ctg atg gac ggc atg aac gtc gtc ggc gac ctg ttc ggc gcg 2352 
Gly Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala 
770 775 780 

ggc aag atg ttc ctg ccc cag gtg gtg aag teg gee cgc gtg atg aag 2400 
Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val Met Lys 
785 790 795 800 

cag gee gtc gee tgg ctg atg ccg ttc atg gag gee gag aag gaa ggc 2448 
Gin Ala Val Ala Trp Leu Met Pro Phe Met Glu Ala Glu Lys Glu Gly 
805 810 815 

cag gag cgc aag gec gee ggc aag gtg ctg atg gee acc gtc aag ggc 24 96 
Gin Glu Arg Lys Ala Ala Gly Lys Val Leu Met Ala Thr Val Lys Gly 
820 825 830 
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gac gtc cac gac ate ggt aag aac ate gtc ggc gtc gtg ctg cag tgt 2544 
Asp Val His Asp lie Gly Lys Asn He Val Gly Val Val Leu Gin CyB 
835 840 845 

aac aac tac gag gtc gtg gac ctg ggt gtc atg gtg ccc gee gac cgc 2592 
Asn Asn Tyr Glu Val Val Asp Leu Gly Val Met Val Pro Ala Asp Arg 
650 655 860 

ate ctg gac gaa gec aag aag cac aag gtc gac atg ate ggc ctg teg 2640 
He Leu Asp Glu Ala Lys Lys His Lys Val Asp Met He Gly Leu Ser 
865 870 875 880 

ggc ctg ate ace ccc teg ctg gac gag atg gtg ttc gtg gee gee gag 2688 
Gly Leu He Thr Pro Ser Leu Asp Glu Met Val Phe Val Ala Ala Glu 
885 890 895 

atg gag cgc cag ggc ttt gat ate ccg ctg ctg ate ggc ggc gee acc 2736 
Met Glu Arg Gin Gly Phe Asp He Pro Leu Leu He Gly Gly Ala Thr 
900 905 910 

acc age cgc acc cac acc gcg gtg aag ate gag ccg gee tat cgc egg 2784 
Thr Ser Arg Thr His Thr Ala Val Lys He Glu Pro Ala Tyr Arg Arg 
915 920 925 

ggt ccg acg acc tat gtc gtc gac gec age cgc gee gtg ggc gtg gtc 2832 
Gly Pro Thr Thr Tyr Val Val Asp Ala Ser Arg Ala Val Gly Val Val 
930 935 940 

teg ggc ctg ctg teg gaa ggc gag cgt gac egg ate ate gec gag acc 2880 
Ser Gly Leu Leu Ser Glu Gly Glu Arg Asp Arg He He Ala Glu Thr 
945 950 955 960 

cgc gee gag tat gtg aag gtc cgc gag caa tac gcg cgc ggc cag acc 2928 
Arg Ala Glu Tyr Val Lys Val Arg Glu Gin Tyr Ala Arg Gly Gin Thr 
965 970 975 

acc aag gee cgc gee teg ate cag gag gee cgc aag cgc gec ttc gee 2976 
Thr Lys Ala Arg Ala Ser He Gin Glu Ala Arg Lys Arg Ala Phe Ala 
980 985 990 

att gac tgg aag ggc tat gcg ccg ccc aag ccc gee ttc ate ggc acg 3024 
He Asp Trp Lys Gly Tyr Ala Pro Pro Lys Pro Ala Phe He Gly Thr 
995 1000 1005 

egg gtg ttc gag ccg teg ctg gee gag ctg gtc ccg ttc ate gac tgg 3072 
Arg Val Phe Glu Pro Ser Leu Ala Glu Leu Val Pro Phe He Asp Trp 
1010 1015 1020 

teg ccg ttc ttc gee age tgg gag ctg ate ggc cgc ttc ccg cag ate 3120 
Ser Pro Phe Phe Ala Ser Trp Glu Leu He Gly Arg Phe Pro Gin He 
1025 1030 1035 1040 

ctg gag gac gac gtg gtc ggc cag gec gec acc gac etc tac cgc gac 3168 
Leu Glu Asp Asp Val Val Gly Gin Ala Ala Thr Asp Leu Tyr Arg Asp 
1045 1050 1055 

gee cgc gee atg ctg gac aag gtg gtc gag gaa aag tgg ttc ggg gec 3216 
Ala Arg Ala Met Leu Asp Lys Val Val Glu Glu Lys Trp Phe Gly Ala 
1060 1065 1070 

aag ggc gtg ate ggc ttc tgg ccg gec cag gee cag ggc gac gac ate 3264 
Lys Gly Val He Gly Phe Trp Pro Ala Gin Ala Gin Gly Asp Asp He 
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1080 



1085 



gtg etc tat acc gac gag acc cgc gtg gec gag ttc teg cgc ctg cac 
Val Leu Tyr Thr Asp Glu Thr Arg Val Ala Glu Phe Ser Arg Leu His 
1090 1095 1100 



3312 



acc ctt cgc cag cag atg gac aag ggc gec gac aag age ggc gag gec 
Thr Leu Arg Gin Gin Met Asp Lys Gly Ala Asp Lys Ser Gly Glu Ala 
1105 1110 1115 1120 



3360 



aag gee aat gtc gec ctg teg gac ttc gtc gcg ccg ate ggg cag ggg 
Lys Ala Asn Val Ala Leu Ser Asp Phe Val Ala Pro lie Gly Gin Gly 
1125 1130 1135 



3408 



get gac tat gtc ggc ggc ttc gec gtc acc gca ggc cat ggc gag gac 3456 

Ala Asp Tyr Val Gly Gly Phe Ala Val Thr Ala Gly His Gly Glu Asp 
1140 1145 1150 

gag ate gtc gec aag ttc aag gcg gee ggc gac gac tac aac gec ate 3504 

Glu He Val Ala Lys Phe Lys Ala Ala Gly Asp Asp Tyr Asn Ala He 
1155 1160 1165 



atg gee teg gec ctg gee gac cgc ctg gec gaa gee ttc gee gag tgg 
Met Ala Ser Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Trp 
1170 1175 1180 



3552 



ctg cac tac aaa gee cgt gtc gag ctg tgg ggc tac gee gec gac gag 
Leu His Tyr Lys Ala Arg Val Glu Leu Trp Gly Tyr Ala Ala Asp Glu 
1185 1190 1195 1200 



3600 



gac gee gac gtc gag cgc ctg ate gee gaa aag tac cag ggc ate cgc 
Asp Ala Asp Val Glu Arg Leu He Ala Glu Lys Tyr Gin Gly He Arg 
1205 1210 1215 



3648 



ccc gcg ccc ggc tat ccg gec cag ccc gac cac acc gag aaa ggt acg 
Pro Ala Pro Gly Tyr Pro Ala Gin Pro Asp His Thr Glu Lys Gly Thr 
1220 1225 1230 



3696 



ctg ttc aag ctg etc gac gee gag gcg gee acc ggt ctg cag ctg acc 
Leu Phe Lys Leu Leu Asp Ala Glu Ala Ala Thr Gly Leu Gin Leu Thr 
1235 1240 1245 



3744 



gag age tac gec atg acc cct ggc gcg gcg gtc tec ggc ctg ttc ttc 
Glu Ser Tyr Ala Met Thr Pro Gly Ala Ala Val Ser Gly Leu Phe Phe 
1250 1255 1260 



3792 



age cac cgc cag gcg cac tat ttc ggg gtc ggc aag ate gac gec gac 
Ser His Arg Gin Ala His Tyr Phe Gly Val Gly Lys He Asp Ala Asp 
1265 1270 1275 1280 



3840 



cag gtc gag gac tac gee cgc cgc aag ggc tgg gat atg gag acg gee 
Gin Val Glu Asp Tyr Ala Arg Arg Lys Gly Trp Asp Met Glu Thr Ala 
1285 1290 1295 



3888 



gag cgc tgg ctg teg ccg ate ctg aac tac gat ccg eta gcg egg gcg 
Glu Arg Trp Leu Ser Pro He Leu Asn Tyr Asp Pro Leu Ala Arg Ala 
1300 1305 1310 



3936 



cgc ggg gcg gcg get tag 
Arg Gly Ala Ala Ala 
1315 



3954 
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<210> 40 
<211> 1317 

<212> PRT « 
<213> Caulobacter_crescentus 

<220> 

<221> unsure 
<222> 359 ..359 

<223> All occurrences of Xaa indicate any amino acid , 
<220> 

<221> unsure 
<222> 416 . . 416 

<223> All occurrences of Xaa indicate any amino acid 
<400> 40 

Met Thr Asp Leu Ser He Arg Ala Asn Arg Val Ala Ala Leu Lys Ala 
15 10 15 

Ala Ala Lys Glu Arg He Leu He Leu Asp Gly Ser Trp Gly Val Met 
20 25 * 30 

Phe Gin Lys Lys Gly Leu Thr Glu Ala Asp Tyr Arg Ala Glu Arg Phe 
35 40 45 

Ala Ala Tyr Asn Gly Gin Met Lys Gly Asn Asn Asp He Leu Cys Leu 
50 55 60 

Thr Arg Pro Asp Leu Val Ala Glu Leu His Asp Ala Tyr Phe Ser Ala 
65 70 75 80 

Gly Ala Asp He Ser Glu Thr Asn Thr Phe Ser Gly Thr Thr He Ala 
85 90 95 

Gin Ala Asp Tyr His Leu Gly Glu Gin Asp Val Trp Asp He Asn Leu 
100 105 * 110 

Glu Gly Ala Lys He Gly Arg Ser Val Ala Asp Arg Trp Asn Ala Gin 
115 120 125 

Asn Pro Asp Arg Pro Lys Phe He Ala Gly Ser Met Gly Pro Leu Asn 
130 135 140 

Val Met Leu Ser Met Ser Ser Asp Val Asn Asp Pro Gly Ala Arg Lys 
145 150 155 160 

Val Thr Phe Asp Gin Val Tyr Glu Ala Tyr Arg Gin Gin Val Asp Ala 
165 170 175 

Leu Tyr Gin Gly Gly Val Asp Leu Phe Leu He Glu Thr He Thr Asp 
180 165 190 

Thr Leu Asn Cys Lys Ala Ala He Lys Ala He Leu Asp Trp Arg Asp 
195 200 205 

Glu Gly His Glu Glu Leu Pro He Trp He Ser Gly Thr He Thr Asp 
210 215 220 

Arg Ser Gly Arg Thr Leu Ser Gly Gin Thr Ala Glu Ala Phe Trp Asn 
225 230 235 240 



WO 03/087386 PCT/EP03/04010 

177 

Ser Val Lys His Ala Lys Pro Phe Ala Val Gly Phe Asn Cys Ala Leu 
245 250 255 

Gly Ala Asp Leu Met Arg Pro His He Ala Glu Met Ala Arg He Ala 
260 265 270 

Asp Thr Leu Val Ala Ala Tyr Pro Asn Ala Gly Leu Pro Asn Ala Met 
275 280 285 

Gly Gin Tyr Asp Glu Glu Pro His Glu Thr Gly His Ala Leu His Glu 
290 295 300 

Trp Ala Lys Asp Gly Leu Val Asn He Leu Gly Gly Cys Cys Gly Thr 
305 310 315 320 

Thr Pro Asp His He Arg His Val Ala Asp Glu Val Arg Gly Val Thr 
325 330 335 

Pro Arg Gin He Pro Glu Arg Pro Lys Ala Met Arg Leu Ala Gly Leu 
340 345 350 

Glu Pro Phe Glu Leu Ala Xaa Trp Leu Arg Pro Gin He Pro Phe Ser 
355 360 365 

Pro Cys Gly Arg Arg Cys Arg Arg Arg Arg Arg Met Arg Gly Leu Ala 
370 375 380 

Gly Pro Ser Thr Ala Val Ser Arg Arg Arg Arg Ser Ser Thr Pro His 
385 390 395 400 

Pro Thr Arg Cys Ala Gly His Leu Leu Pro Gin Gly Glu Lys Gly Xaa 
405 410 415 

Leu Leu Leu Asp Pro Glu Met Arg Pro Val Phe Val Asn He Gly Glu 
420 425 430 

Arg Thr Asn Val Thr Gly Ser Ala Lys Phe Lys Lys Leu He Val Glu 
435 440 445 

Gly Asn Tyr Pro Glu Ala Leu Ser Val Ala Arg Gin Gin Val Glu Ala 
450 455 460 

Gly Ala Gin Val He Asp Val Asn Met Asp Glu Gly Leu Leu Asp Ser 
465 470 475 480 

Gin Gin Ala Met Val Thr Phe Leu Asn Leu Met Ala Ala Glu Pro Asp 
485 490 495 

He Ala Arg Val Pro Val Met He Asp Ser Ser Lys Trp Glu Val He 
500 505 510 

Glu Ala Gly Leu Lys Cys Val Gin Gly Lys Ala He Val Asn Ser He 
515 520 525 

Ser Leu Lys Glu Gly Glu Glu Lys Phe Leu Glu Gin Ala Thr Leu Cys 
530 535 540 

Leu Arg Tyr Gly Ala Ala Val Val Val Met Ala Phe Asp Glu val Gly 
545 550 555 560 

Gin Ala Asp Thr Glu Lys Arg Lys Val Glu He Cys Thr Arg Ala Tyr 
565 570 575 
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Asn Thr Leu Val Asp Lye Val Gly phe Pro Pro Glu Asp He He Phe 
580 585 590 

Asp Pro Asn He Phe Ala Val Ala Thr Gly He Glu Glu His Asp Asn 
595 600 605 

Tyr Ala Val Asp Phe He Glu Ala Thr Arg Arg He Lys Oln Met Leu 
610 615 620 

Pro Tyr Ala Arg Val Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe 
625 630 635 640 

Arg Gly Asn Glu Pro Val Arg Arg Ala He His Ser Val Phe Leu Tyr 
645 650 655 

His Ala He Asn Ala Gly Met Asp Met Gly He Val Asn Ala Gly Asp 
660 665 670 

Leu Pro Val Tyr Asp Asp He Asp Pro Ala Leu Arg Glu Ala Val Glu 
675 680 685 

Asp Val He Leu Asn Arg Pro Gin Arg Asp Pro Val Met Thr Asn Thr 
690 695 70_0 

Glu Arg Leu Val Glu Met Ala Pro Arg Tyr Lys Gly Glu Lys Gly Gin 
705 710 715 720 

Gin Gin Val Ala Asn Leu Glu Trp Arg Lys Gly Thr Val Asn Glu Arg 
725 730 735 

Leu Thr His Ala Leu Val His Gly He Thr Glu Phe He Glu Gin Asp 
740 745 750 

Thr Glu Glu Ala Arg Leu Ala Ala Glu Arg Pro Leu His Val He Glu 
755 760 765 

Gly Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala 
770 775 780 

Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val Met Lys 
785 790 795 BOO 

Gin Ala Val Ala Trp Leu Met Pro Phe Met Glu Ala Glu Lys Glu Gly 
805 810 * 815 

Gin Glu Arg Lys Ala Ala Gly Lys Val Leu Met Ala Thr Val Lys Gly 
820 825 830 

Asp Val His Asp He Gly Lys Asn He Val Gly Val Val Leu Gin Cys 
835 840 845 

Asn Asn Tyr Glu Val Val Asp Leu Gly Val Met Val Pro Ala Asp Arg 
850 855 860 

He Leu Asp Glu Ala Lys Lys His Lys Val Asp Met He Gly Leu Ser 
865 870 875 880 

Gly Leu He Thr Pro Ser Leu Asp Glu Met Val Phe Val Ala Ala Glu 
885 690 895 

Met Glu Arg Gin Gly Phe Asp He Pro Leu Leu He Gly Gly Ala Thr 
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900 



905 



910 



Thr Ser Arg Thr His Thr Ala 
915 



Val Lys lie 
920 



Glu Pro 



Ala Tyr 
925 



Arg 



Arg 



Gly Pro Thr Thr Tyr Val Val Asp Ala Ser Arg Ala Val Oly Val Val 
930 935 940 

Ser Gly Leu Leu Ser Glu Gly Glu Arg Asp Arg He He Ala Glu Thr 
945 950 955 960 

Arg Ala Glu Tyr Val Lys Val Arg Glu Gin Tyr Ala Arg Gly Gin Thr 
965 970 975 

Thr Lys Ala Arg Ala Ser He Gin Glu Ala Arg Lys Arg Ala Phe Ala 
960 985 990 

He Asp Trp Lys Gly Tyr Ala Pro Pro Lys Pro Ala Phe He Gly Thr 
995 1000 1005 

Arg Val Phe Glu Pro Ser Leu Ala Glu Leu Val Pro Phe He Asp Trp 
1010 1015 1020 

Ser Pro Phe Phe Ala Ser Trp Glu Leu lie Gly Arg Phe Pro Gin He 
1025 1030 1035 " 1040 

Leu Glu Asp Asp Val Val Gly Gin Ala Ala Thr Asp Leu Tyr Arg Asp 
1045 1050 1055 

Ala Arg Ala Met Leu Asp Lys Val Val Glu Glu Lys Trp Phe Gly Ala 
1060 1065 1070 

Lys Gly Val He Gly Phe Trp Pro Ala Gin Ala Gin Gly Asp Asp He 
1075 1080 1085 

Val Leu Tyr Thr Asp Glu Thr Arg Val Ala Glu Phe Ser Arg Leu His 
1090 1095 1100 

Thr Leu Arg Gin Gin Met Asp Lys Gly Ala Asp Lys Ser Gly Glu Ala 
1105 1110 ins H20 

Lys Ala Asn Val Ala Leu Ser Asp Phe Val Ala Pro He Gly Gin Gly 
1125 1130 1135 

Ala Asp Tyr Val Gly Gly Phe Ala Val Thr Ala Gly His Gly Glu Asp 
1140 1145 1150 

Glu He Val Ala Lys Phe Lys Ala Ala Gly Asp Asp Tyr Asn Ala He 
1155 1160 H65 

Met Ala Ser Ala Leu Ala ABp Arg Leu Ala Glu Ala Phe Ala Glu Trp 
1170 1175 1180 

Leu His Tyr Lys Ala Arg Val Glu Leu Trp Gly Tyr Ala Ala Asp Glu 
1185 1190 1195 1200 

Asp Ala Asp Val Glu Arg Leu He Ala Glu Lys Tyr Gin Gly He Arg 
1205 1210 1215 

Pro Ala Pro Gly Tyr Pro Ala Gin Pro Asp His Thr Glu Lys Gly Thr 



1220 



1225 



1230 
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Leu Phe Lys Leu Leu Asp Ala Glu Ala Ala Thr Gly Leu Gin Leu Thr 
1235 1240 1245 

Glu Ser Tyr Ala Met Thr Pro Gly Ala Ala Val Ser Gly Leu, Phe Phe 
1250 1255 1260 

Ser His Arg Gin Ala His Tyr Phe Gly Val Gly Lys lie Asp Ala Asp 
1265 1270 1275 1280 

Gin Val Glu Asp Tyr Ala Arg Arg Lys Gly Trp Asp Met Glu Thr Ala 
1285 1290 1295 

Glu Arg Trp Leu Ser Pro He Leu Asn Tyr Asp Pro Leu Ala Arg Ala 
1300 1305 1310 

Arg Gly Ala Ala Ala 
1315 



<210> 41 
<211> 3759 
<212> DNA 

<213> Rhodobacter capsulatus 

<220> 
<221> CDS 
<222> (1)..<3756) 
<223> RRC01731 



<400> 41 

atg ctg acc cag acc ctg ccc cga tct gcg gcc ttt gcc gca att gag 48 
Met Leu Thr Gin Thr Leu Pro Arg Ser Ala Ala Phe Ala Ala He Glu 
1 5 10 15 

gcg ctt teg cgc cag egg ate ttg ate ctt gac ggg gcg atg ggc acg 96 
Ala Leu Ser Arg Gin Arg He Leu He Leu Asp Gly Ala Met Gly Thr 
20 25 30 

cag ate cag cag ctt ggc ctg age gag gac gat ttt ctg ggc cac ggc 144 
Gin lie Gin Gin Leu Gly Leu Ser Glu Asp Asp Phe Leu Gly His Gly 
35 40 45 

teg ggc tgc gcc tgc cgc cat gcc acc gat cat ccg caa aag ggc aac 192 
Ser Gly Cys Ala Cys Arg His Ala Thr Asp His Pro Gin Lys Gly Asn 
50 55 60 

aac gac ctg ctg gtg ctg acc cag ccg caa gcg ate gag gag ate cat 240 
Asn Asp Leu Leu Val Leu Thr Gin Pro Gin Ala He Glu Glu He His 
65 70 75 80 

ttc cgc tat gcg atg gcg ggg gcg gat ate gtc gag acg aac acc ttt 288 
Phe Arg Tyr Ala Met Ala Gly Ala Asp lie Val Glu Thr Asn Thr Phe 
85 90 95 

teg gcc acc acc ate gcg cag gcc gat tac ggg ctg gaa age gcg gtg 336 
Ser Ala Thr Thr lie Ala Gin Ala Asp Tyr Gly Leu Glu Ser Ala Val 
100 105 no 

ttc gac ctg aac gcc gcg ggg gcg egg gtg gcg egg gcg gcg atg gac 384 
Phe Asp Leu Asn Ala Ala Gly Ala Arg Val Ala Arg Ala Ala Met Asp 
115 120 125 
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cgc gcc gag gcc acc gac gga egg cgc cgc ttc gtt gcg ggg gcg gtg 432 
Arg Ala Glu Ala Thr Asp Gly Arg Arg Arg Phe Val Ma Gly Ala Val 
130 135 140 

ggg ccg acg aac cgc acc gcc teg etc teg ccc gat gtg aac gac ccg 480 
Gly Pro Thr Asn Arg Thr Ala Ser Leu Ser Pro Asp Val Asn Asp Pro 
145 150 155 160 

ggc ttt cgc gcc gtc acc ttc gac gat ctg cgc acg gcc tat ggc cag 528 
Gly Phe Arg Ala Val Thr Phe Asp Asp Leu Arg Thr Ala Tyr Gly Gin 
165 170 175 

cag gtg cgc ggt ctg ate gcg ggg ggc gcc gat ate ctg ctg ate gag 576 
Gin Val Arg Gly Leu lie Ala Gly Gly Ala Asp He Leu Leu lie Glu 
180 185 190 

acg ate ttt gac acg ctg aac gcc aag gcg gcg att ttc gcc tgt ttc 624 
Thr He Phe Asp Thr Leu Asn Ala Lys Ala Ala He Phe Ala Cys Phe 
195 200 205 

gaa gcc ttt gcc gaa egg ggc gag egg ctg ccg gtg atg att tec ggc 672 
Glu Ala Phe Ala Glu Arg Gly Glu Arg Leu Pro Val Met He Ser Gly 
210 215 220 

acg ate acc gat gcc teg ggg cgc aca ttg teg ggg cag acg ccg acc 720 
Thr He Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr Pro Thr 
225 230 235 240 

gcg ttc tgg cat teg gtg get cat gcc egg ccc ttt acc gtg ggg ctg 768 
Ala Phe Trp His Ser Val Ala His Ala Arg Pro Phe Thr Val Gly Leu 
245 250 255 

aac tgc gcg ctg ggc gcc agt gcg atg cgt ccg cat ctg gcg gaa ctg 816 
Asn Cys Ala Leu Gly Ala Ser Ala Met Arg Pro His Leu Ala Glu Leu 
260 265 270 

gcg ggc gtc gcc ccc tgc gcg ate tgc gcc tat ccc aat gcc ggg ctg 864 
Ala Gly Val Ala Pro Cys Ala He Cys Ala Tyr Pro Asn Ala Gly Leu 
275 280 285 

ccc aat gcc ttt ggc caa tat gac gaa acc ccc gac egg acc gcc gcg 912 
Pro Asn Ala Phe Gly Gin Tyr Asp Glu Thr Pro Asp Arg Thr Ala Ala 
290 295 300 

cag gtg gcc gaa ttt gcc cgc gaa ggg ctg gtc aat gtc gtg ggc ggt 960 
Gin Val Ala Glu Phe Ala Arg Glu Gly Leu Val Asn Val Val Gly Gly 
305 310 315 320 

tgc tgc ggc acc acc ccc gat cac ate cgc gcc ate gcg gaa gcc gtg 1008 
Cys Cys Gly Thr Thr Pro Asp His He Arg Ala He Ala Glu Ala Val 
325 330 335 

aaa cct ttc ccg ccg agg gcc ctg cca age cgt tat ctg cgc ctt teg 1056 
Lys Pro Phe Pro Pro Arg Ala Leu Pro Ser Arg Tyr Leu Arg Leu Ser 
340 345 350 

ggg ctt gag ccc ttt acc ctg acg ccc gac att ccc ttc gtg aac ate 1104 
Gly Leu Glu Pro Phe Thr Leu Thr Pro Asp He Pro Phe Val Asn He 
355 360 365 

ggc gag cgc acg aat gtc acc ggc teg gcc egg ttc cgc aag atg ate 1152 
Gly Glu Arg Thr Asn Val Thr Gly Ser Ala Arg Phe Arg Lys Met He 
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370 375 380 

gtc gcc cgc gac tat gcc gcc gcg ctg gat gtc gcc cgc gat cag gtg 1200 
Val Ala Arg Asp Tyr Ala Ala Ala Leu Asp Val Ala Arg Asp , Gin Val 
385' 390 395 400 

gaa aac ggc gcg cag ate ctt gac ate aac atg gac gag ggg ctg ate 1248 
Glu Asn Gly Ala Gin lie Leu Asp He Asn Met Asp Glu Gly Leu He 
405 410 415 

gac agt cag gcg gcg atg gtc gcc ttc etc aac etc ttg gcc gcc gag 1296 
Asp Ser Gin Ala Ala Met Val Ala Phe Leu Asn Leu Leu Ala Ala Glu 
420 425 430 

ccc gac att gcc egg gtg ccg gtg atg ate gac age teg aaa tgg gag 1344 
Pro Asp He Ala Arg Val Pro Val Met He Asp Ser Ser Lys Trp Glu 
435 440 445 

gtg ate gag gcc ggg ctg aaa tgc gtg cag ggc aag ccc gtc gtc aat 1392 
Val He Glu Ala Gly Leu Lys Cys Val Gin Gly Lys Pro Val Val Asn 
450 455 460 

teg ate age ctg aag gag ggc gag gag ate ttc cgc cat cac gcg gcg 1440 
Ser He Ser Leu Lys Glu Gly Glu Glu He Phe Arg H1b His Ala Ala 
465 470 475 ~ 480 

ctg tgt ctg gcc tat ggc gcg gcg gtc gtc gtg atg gcc ttt gac gaa 1488 
Leu Cys Leu Ala Tyr Gly Ala Ala Val Val Val Met Ala Phe Asp Glu 
485 4 90 495 

gag ggg cag gcc gac agt ttc gcc cga aag acc age ate tgc gcc cgc 1536 
Glu Gly Gin Ala Asp Ser Phe Ala Arg Lys Thr Ser He Cy8 Ala Arg 
500 505 510 

gcc tat cgc att ctg gtc gag gag ate ggc ttt ccg ccc gaa gac ate 1584 
Ala Tyr Arg He Leu Val Glu Glu He Gly Phe Pro Pro Glu Asp He 
515 520 525 

ate ttt gac ccg aac gtc ttt gcc gtc gcc acg ggc ate gaa gaa cac 1632 
He Phe Asp Pro Asn Val Phe Ala Val Ala Thr Gly He Glu Glu His 
530 535 540 

gac aat tac ggc gtt gat ttc ate gag gcc get cgc tgg ate egg gcc 1680 
Asp Asn Tyr Gly Val Asp Phe He Glu Ala Ala Arg Trp He Arg Ala 
545 550 555 560 

aac ctg ccg cat gcc cat gtc teg ggc ggg gtg teg aac ctg tec ttc 1728 
Asn Leu Pro His Ala His Val Ser Gly Gly Val Ser Asn Leu Ser Phe 
565 570 575 

age ttt cgc ggc aac gaa ccc gtg cgc gcg gcg atg cat gcg gtg ttt 1776 
Ser Phe Arg Gly Asn Glu Pro Val Arg Ala Ala Met His Ala Val Phe 
580 585 590 

ctt tac cac gcc ate cgc gcc ggg atg gat atg ggg ate gtc aat gcc 1824 
Leu Tyr His Ala He Arg Ala Gly Met Asp Met Gly He Val Asn Ala 
595 600 605 



ggg cag ctg gtg gtc tat gac cag ate gac ccc gag ctg cgc cag gcc 1872 
Gly Gin Leu Val Val Tyr Asp Gin He Asp Pro Glu Leu Arg Gin Ala 
610 615 620 
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tgc gag gat gtg gtg etc aac cgc cag ccc aaa teg ggc ggc ace gcg 1920 
Cys Glu Asp Val Val Leu Asn Arg Gin Pro Lye Ser Gly Gly Thr Ala 
625 630 635 640 

ace gag egg atg ctg gag gtg gee gag cgc ttc cgc ggc ggc gcg cgc 1968 
Thr Glu Arg Met Leu Glu Val Ala Glu Arg Phe Arg Gly Gly Ala Arg 
645 650 " " 655 

gag gaa aag acc cgc gat ctg gec tgg cgc gac tgg ccg gtg gaa aag 2016 
Glu Glu Lys Thr Arg Asp Leu Ala Trp Arg Asp Trp Pro Val Glu Lys 
660 665 670 

egg etc gaa cat gcg ctg gtc aat ggc ate acc gaa ttc ate gag gee 2064 
Arg Leu Glu His Ala Leu Val Asn Gly He Thr Glu Phe He Glu Ala 
675 680 685 

gat acc gaa gee gca agg ctt ctg gee gaa cgc ccg ctg cat gtg ate 2112 
Asp Thr Glu Ala Ala Arg Leu Leu Ala Glu Arg Pro Leu His Val He 
690 695 700 

gaa ggg ccg ctg atg gcg ggg atg aat gtc gtc ggt gat ctg ttc ggc 2160 
Glu Gly Pro Leu Met Ala Gly Met Asn Val Val Gly Asp Leu Phe Gly 
705 710 715 720 

gcg ggc aag atg ttc ctg cca cag gtg gtg aaa teg gcg cgc gtg atg 2208 
Ala Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val Met 
725 730 735 

aaa cag gee gtc gee gtt ctg ctg ccc tac atg gat gee gaa aag gee 2256 
Lys Gin Ala Val Ala Val Leu Leu Pro Tyr Met Asp Ala Glu Lys Ala 
740 745 750 

gcg cgc ggc ggc gag ggg cgc gaa acc gcg ggc aag ate ctg atg gec 2304 
Ala Arg Gly Gly Glu Gly Arg Glu Thr Ala Gly Lys He Leu Met Ala 
755 760 765 

acg gtc aag ggc gat gtg cat gac ate ggc aag aac ate gtc ggc gtc 2352 
Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val Gly Val 
770 775 780 

gtg ctg gee tgc aac aat tac gac ate gtc gac ctg ggc gtg atg gtg 2400 
Val Leu Ala Cys Asn Asn Tyr Asp He Val Asp Leu Gly Val Met Val 
785 790 795 800 

ccg ccg caa aag ate ctg gaa gtg gcg egg gec gaa aag gtc gat gcg 2448 
Pro Pro Gin Lys He Leu Glu Val Ala Arg Ala Glu Lys Val Asp Ala 
805 810 815 

ate ggg ctt tec ggg ctg ate acg cca age ctg gac gag atg gtg cat 2496 
He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met Val His 
820 825 830 

ctg gec gcg gaa atg gag cgc gag ggc ttt gac att ccg ctg ctg ate 2544 
Leu Ala Ala Glu Met Glu Arg Glu Gly Phe Asp He Pro Leu Leu He 
835 840 845 

ggc ggg gcg acc acg teg aaa gtg cat acg gcg gtg aag ate gec ccc 2592 
Gly Gly Ala Thr Thr Ser Lys Val His Thr Ala Val Lys He Ala Pro 
850 855 860 

gee tac age cgc ggg cag gcg gtt tat gtg etc gat gee age egg gee 2640 
Ala Tyr Ser Arg Gly Gin Ala Val Tyr Val Leu Asp Ala Ser Arg Ala 
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865 870 8?5 880 

gtg 999 9tg gtg ggg gcg ctt ttg age ccg aac cag aag gtc gat tac 2688 
Val Gly Val Val Gly Ala Leu Leu Ser Pro Aen Gin Lye Val. Asp Tyr 
885 890 895 

gcg gcg cag ate cgc gcg gac tat gcg cag ate gec gec cgt cat gec 2736 
Ala Ala Gin lie Arg Ala Asp Tyr Ala Gin He Ala Ala Arg Hie Ala 
900 90S 910 

cgc gac gag gec gec aag gtg egg ctg cct ttg gec gcg gec egg gec 2784 
Arg Asp Glu Ala Ala Lys Val Arg Leu Pro Leu Ala Ala Ala Arg Ala 
915 920 925 

aat gcg ctg egg etc gac tgg teg ggc tat gec gtg ccc gcg ccg caa 2832 
Asn Ala Leu Arg Leu Asp Trp Ser Gly Tyr Ala Val Pro Ala Pro Gin 
930 935 940 

ttc ctt ggc ccg cgc gtg ate gac gac tgg gat ctg gec gaa gtg gcg 2880 
Phe Leu Gly Pro Arg Val He Asp Asp Trp Asp Leu Ala Glu Val Ala 
945 950 955 960 

egg tat ate gac tgg acg ccc ttc ttc cat gec tgg gaa ttg aag ggg 2928 
Arg Tyr He Asp Trp Thr Pro Phe Phe His Ala Trp Glu Leu Lys Gly 
965 970 975 

gtc tat ccg egg att etc gat gac gec gaa aag ggc gaa gcg gcg egg 2976 
Val Tyr Pro Arg He Leu Asp Asp Ala Glu Lys Gly Glu Ala Ala Arg 
980 985 990 

gca ctt ttc gec gat gec cag gcg atg ctg gcg cag ate att gee gaa 3024 
Ala Leu Phe Ala Asp Ala Gin Ala Met Leu Ala Gin He lie Ala Glu 
995 1000 1005 

cgc tgg ttc ace ccg cgc gee gtg gtg ggg ttc tgg ccc gcg cag gcg 3072 
Arg Trp Phe Thr Pro Arg Ala Val Val Gly Phe Trp Pro Ala Gin Ala 
1010 1015 1020 

gtg ggc gac gat ate egg ctt tac ace gac gag age egg acc gaa gac 3120 
Val Gly Asp Asp He Arg Leu Tyr Thr Asp Glu Ser Arg Thr Glu Asp 
1025 1030 1035 1040 

etc gee act ttc ttc acc ctg cgc cag cag acc ggc aag cgc gaa ggc 3168 
Leu Ala Thr Phe Phe Thr Leu Arg Gin Gin Thr Gly Lys Arg Glu Gly 
1045 1050 1055 

cgc ccg aat gtg get ttg gec gat ttc gtc gcg cct gcg ggc acg gtg 3216 
Arg Pro Asn Val Ala Leu Ala Asp Phe Val Ala Pro Ala Gly Thr Val 
1060 1065 1070 

ccc gat tat ctg ggc ggc ttc gtg gtc acc gcg ggc ccc gag gaa gee 3264 
Pro Asp Tyr Leu Gly Gly Phe Val Val Thr Ala Gly Pro Glu Glu Ala 
1075 1080 1085 

gag ate gec gcg egg ttc gaa get gee aat gac cat tat tec gcg ate 3312 
Glu He Ala Ala Arg Phe Glu Ala Ala Asn Asp His Tyr Ser Ala He 
1090 1095 1100 

ctg gtc aag gcg ctg gec gac cgc ttt gee gaa gee ctg gee gag gee 3360 
Leu Val Lys Ala Leu Ala Asp Arg Phe Ala Glu Ala Leu Ala Glu Ala 
1105 1110 1115 1120 
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ctg cat cag egg gtg egg cgc gac tat tgg ggc tat gcg ccc gaa gaa 3408 
Leu His Gin Arg Val Arg Arg Asp Tyr Trp Gly Tyr Ala Pro Glu Glu 
1125 1130 1135 

age ttc gee ccc gat cag ctg gtg ggc gag ccc tat cgc ggc ate cgc 3456 
Ser Phe Ala Pro Asp Gin Leu Val Gly Glu Pro Tyr Arg Gly lie Arg 
1140 1145 1150 

ccg gcg ccc ggc tat ccg gec cag ccc gac cac acg gaa aag ctg acg 3504 
Pro Ala Pro Gly Tyr Pro Ala Gin Pro Asp His Thr Glu Lys Leu Thr 
1155 1160 1165 

ctg ttc egg ctg ctt ggg gee gag gee gcg acc ggc gtg cat ctg acc 3552 
Leu Phe Arg Leu Leu Gly Ala Glu Ala Ala Thr Gly Val His Leu Thr 
1170 1175 1180 

gac age atg gcg atg tgg ccc ggc tct teg gtc teg ggg etc tat ate 3600 
Asp Ser Met Ala Met Trp Pro Gly Ser Ser Val Ser Gly Leu Tyr He 
1185 1190 1195 * 1200 

ggc cat ccg gag gee tat tat ttc ggt ctg gee egg ate gag cag gat 3648 
Gly His Pro Glu Ala Tyr Tyr Phe Gly Leu Ala Arg He Glu Gin Asp 
1205 1210 1215 

cag gee gee gat tac gee gee cgc aag ggc atg gee ttg gec gag gtg 3696 
Gin Ala Ala Asp Tyr Ala Ala Arg Lys Gly Met Ala Leu Ala Glu Val 
1220 1225 1230 

cag cgc tgg ctg gee ccg gtg ctg ggg teg gec gcg ccc gec gee get 3744 
Gin Arg Trp Leu Ala Pro Val Leu Gly Ser Ala Ala Pro Ala Ala Ala 
1235 1240 1245 



gcg gtg gec gcg tga 
Ala Val Ala Ala 
1250 



3759 



<210> 42 
<211> 1252 
<212> PRT 

<213> Rhodobacter capsulatus 
<400> 42 

Met Leu Thr Gin Thr Leu Pro Arg Ser Ala Ala Phe Ala Ala He Glu 
15 10 15 

Ala Leu Ser Arg Gin Arg He Leu He Leu Asp Gly Ala Met Gly Thr 
20 25 30 

Gin lie Gin Gin Leu Gly Leu Ser Glu Asp Asp Phe Leu Gly His Gly 
35 40 45 

Ser Gly Cys Ala Cys Arg His Ala Thr Asp His Pro Gin Lys Gly Asn 
50 55 60 

Asn Asp Leu Leu Val Leu Thr Gin Pro Gin Ala He Glu Glu He His 
65 70 75 80 

Phe Arg Tyr Ala Met Ala Gly Ala Asp He Val Glu Thr Asn Thr Phe 
85 90 95 



Ser Ala Thr Thr He Ala Gin Ala Asp Tyr Gly Leu Glu Ser Ala Val 
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100 105 110 

Phe Asp Leu Asn Ala Ala Gly Ala Arg Val Ala Arg Ala Ala Met Asp 
115 120 125 

Arg Ala Glu Ala Thr Asp Gly Arg Arg Arg Phe Val Ala Gly Ala Val 
130 135 140 

Gly Pro Thr Asn Arg Thr Ala Ser Leu Ser Pro Asp Val Asn Asp Pro 
145 150 155 160 

Gly Phe Arg Ala Val Thr Phe Asp Asp Leu Arg Thr Ala Tyr Gly Gin 
165 170 175 

Gin Val Arg Gly Leu lie Ala Gly Gly Ala Asp He Leu Leu He Glu 
180 185 190 

Thr He Phe Asp Thr Leu Asn Ala Lys Ala Ala He Phe Ala Cys Phe 
195 200 205 

Glu Ala Phe Ala Glu Arg Gly Glu Arg Leu Pro Val Met He Ser Gly 
210 215 220 

Thr He Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr Pro Thr 
225 230 235 240 

Ala Phe Trp His Ser Val Ala His Ala Arg Pro Phe Thr Val Gly Leu 
245 250 255 

Asn Cys Ala Leu Gly Ala Ser Ala Met Arg Pro His Leu Ala Glu Leu 
260 265 270 

Ala Gly Val Ala Pro Cys Ala He Cys Ala Tyr Pro Asn Ala Gly Leu 
275 280 2B5 

Pro Asn Ala Phe Gly Gin Tyr Asp Glu Thr Pro Asp Arg Thr Ala Ala 
290 295 300 

Gin Val Ala Glu Phe Ala Arg Glu Gly Leu Val Asn Val Val Gly Gly 
305 310 315 320 

Cys Cys Gly Thr Thr Pro Asp His He Arg Ala He Ala Glu Ala Val 
325 330 335 

Lys Pro Phe Pro Pro Arg Ala Leu Pro Ser Arg Tyr Leu Arg Leu Ser 
340 345 ~ 350 

Gly Leu Glu Pro Phe Thr Leu Thr Pro Asp He Pro Phe Val ABn He 
355 360 365 

Gly Glu Arg Thr Asn Val Thr Gly Ser Ala Arg Phe Arg Lys Met He 
370 375 360 

Val Ala Arg Asp Tyr Ala Ala Ala Leu Asp Val Ala Arg Asp Gin Val 
385 390 395 400 

Glu Asn Gly Ala Gin He Leu Asp He Asn Met Asp Glu Gly Leu He 
405 410 415 

Asp Ser Gin Ala Ala Met Val Ala Phe Leu Asn Leu Leu Ala Ala Glu 
420 425 430 
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Pro Asp lie Ala Arg Val Pro Val Met He Asp Ser Ser Lys Trp Glu 
435 440 445 

Val He Glu Ala Gly Leu Lys Cys Val Gin Gly Lys Pro Val Val Asn 
450 455 460 

Ser He Ser Leu Lys Glu Gly Glu Glu He Phe Arg His His Ala Ala 
465 470 475 480 

Leu Cya Leu Ala Tyr Gly Ala Ala Val Val Val Met Ala Phe Asp Glu 
485 490 495 

Glu Gly Gin Ala Asp Ser Phe Ala Arg Lye Thr Ser He Cys Ala Arg 
500 505 510 

Ala Tyr Arg He Leu Val Glu Glu He Gly Phe Pro Pro Glu Asp He 
515 520 525 

He Phe Asp Pro Asn Val Phe Ala Val Ala Thr Gly He Glu Glu His 
530 535 540 

Asp Asn Tyr Gly Val Asp Phe He Glu Ala Ala Arg Trp He Arg Ala 
545 550 555 560 

Asn Leu Pro His Ala His Val Ser Gly Gly Val Ser Asn Leu Ser Phe 
565 570 575 

Ser Phe Arg Gly Asn Glu Pro Val Arg Ala Ala Met His Ala Val Phe 
580 585 590 

Leu Tyr His Ala He Arg Ala Gly Met Asp Met Gly He Val Asn Ala 
595 600 605 

Gly Gin Leu Val Val Tyr Asp Gin He Asp Pro Glu Leu Arg Gin Ala 
610 615 620 

Cys Glu Asp Val Val Leu Asn Arg Gin Pro Lys Ser Gly Gly Thr Ala 
625 630 635 640 

Thr Glu Arg Met Leu Glu Val Ala Glu Arg Phe Arg Gly Gly Ala Arg 
645 650 " 655 

Glu Glu Lys Thr Arg Asp Leu Ala Trp Arg Asp Trp Pro Val Glu Lys 
660 665 670 

Arg Leu Glu His Ala Leu Val Asn Gly He Thr Glu Phe He Glu Ala 
675 680 685 

Asp Thr Glu Ala Ala Arg Leu Leu Ala Glu Arg Pro Leu His Val He 
690 695 700 

Glu Gly Pro Leu Met Ala Gly Met Asn Val Val Gly Asp Leu Phe Gly 
705 710 715 720 

Ala Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val Met 
725 730 735 

Lys Gin Ala Val Ala Val Leu Leu Pro Tyr Met Asp Ala Glu Lys Ala 
740 745 750 

Ala Arg Gly Gly Glu Gly Arg Glu Thr Ala Gly Lys He Leu Met Ala 
755 760 765 
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Thr Val Lys Gly Asp Val HiB Asp lie Gly Lys Asn He Val Gly Val 
770 775 780 

Val Leu Ala Cys Asn Asn Tyr Asp lie Val Asp Leu Gly Val Met Val 
785 790 795 , 800 

Pro Pro Gin Lys He Leu Glu val Ala Arg Ala Glu Lys Val Asp Ala 
805 810 815 

He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met Val His 
820 825 830 

Leu Ala Ala Glu Met Glu Arg Glu Gly Phe Asp He Pro Leu Leu He 
835 840 845 

Gly Gly Ala Thr Thr Ser Lys Val His Thr Ala Val Lys He Ala Pro 
850 855 860 

Ala Tyr Ser Arg Gly Gin Ala Val Tyr Val Leu Asp Ala Ser Arg Ala 
865 870 875 880 

Val Gly Val Val Gly Ala Leu Leu Ser Pro Asn Gin Lys Val Asp Tyr 
885 890 895 

Ala Ala Gin lie Arg Ala Asp Tyr Ala Gin He Ala Ala Arg His Ala 
900 905 910 

Arg Asp Glu Ala Ala Lys Val Arg Leu Pro Leu Ala Ala Ala Arg Ala 
915 920 925 

Asn Ala Leu Arg Leu Asp Trp Ser Gly Tyr Ala Val Pro Ala Pro Gin 
930 935 940 

Phe Leu Gly Pro Arg Val He Asp Asp Trp Asp Leu Ala Glu Val Ala 
945 950 955 960 

Arg Tyr He Asp Trp Thr Pro Phe Phe His Ala Trp Glu Leu Lys Gly 
965 970 975 

Val Tyr Pro Arg lie Leu Asp Asp Ala Glu Lys Gly Glu Ala Ala Arg 
980 985 * 990 

Ala Leu Phe Ala Asp Ala Gin Ala Met Leu Ala Gin lie lie Ala Glu 
995 1000 1005 

Arg Trp Phe Thr Pro Arg Ala Val Val Gly Phe Trp Pro Ala Gin Ala 
1010 1015 1020 

Val Gly Asp Asp lie Arg Leu Tyr Thr Asp Glu Ser Arg Thr Glu Asp 
1025 1030 1035 1040 

Leu Ala Thr Phe Phe Thr Leu Arg Gin Gin Thr Gly Lys Arg Glu Gly 
1045 1050 1055 

Arg Pro Asn Val Ala Leu Ala Asp Phe Val Ala Pro Ala Gly Thr Val 
1060 1065 1070 

Pro Asp Tyr Leu Gly Gly Phe Val Val Thr Ala Gly Pro Glu Glu Ala 
1075 1080 1085 

Glu He Ala Ala Arg Phe Glu Ala Ala Asn Asp His Tyr Ser Ala He 
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1090 1095 1100 

Leu Val Lye Ala Leu Ala Asp Arg Phe Ala Glu Ala Leu Ala Glu Ala 
1105 mo ins 1120 

Leu His Gin Arg Val Arg Arg Asp Tyr Trp Gly Tyr Ala Pro Glu Glu 
1125 1130 U35 

Ser Phe Ala Pro Asp Gin Leu Val Gly Glu Pro Tyr Arg Gly He Arg 
1140 1145 1150 

Pro Ala Pro Gly Tyr Pro Ala Gin Pro Asp His Thr Glu Lys Leu Thr 
1155 1160 1165 

Leu Phe Arg Leu Leu Gly Ala Glu Ala Ala Thr Gly Val His Leu Thr 
1170 1175 HBO 

Asp Ser Met Ala Met Trp Pro Gly Ser Ser Val Ser Gly Leu Tyr He 
1185 1190 1195 1200 

Gly His Pro Glu Ala Tyr Tyr Phe Gly Leu Ala Arg He Glu Gin Asp 
1205 1210 1215 

Gin Ala Ala Asp Tyr Ala Ala Arg Lys Gly Met Ala Leu Ala Glu Val 
1220 1225 1230 

Gin Arg Trp Leu Ala Pro Val Leu Gly Ser Ala Ala Pro Ala Ala Ala 
1235 1240 1245 

Ala Val Ala Ala 
1250 



<210> 43 
<211> 3798 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> <1)..(3795) 

<223> RHS24705 

<400> 43 

atg tea ccc gcg etc caa gac ctg teg caa ccc gaa ggt ctg aag aaa 48 

Met Ser Pro Ala Leu Gin Asp Leu Ser Gin Pro Glu Gly Leu Lys Lys 
1 5 10 is 

acc ctg egg gat gag ate aat gee att ctg cag aag agg att atg gtg 96 
Thr Leu Arg Asp Glu lie Asn Ala lie Leu Gin Lys Arg lie Met Val 
20 25 30 

ctg gat gga ggg atg ggg acc atg ate cag egg gag aag eta aac gaa 144 
Leu Asp Gly Gly Met Gly Thr Met He Gin Arg Glu Lys Leu Asn Glu 
35 40 45 

gaa cac ttc cga ggt cag gaa ttt aaa gat cat gee agg ccg ctg aaa 192 
Glu His Phe Arg Gly Gin Glu Phe Lys Asp His Ala Arg Pro Leu Lys 
50 55 60 



ggc aac aat gac att tta agt ata act cag cct gat gtc att tac caa 240 
Gly Asn Asn Asp lie Leu Ser lie Thr Gin Pro Asp Val He Tyr Gin 
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65 70 75 80 

ate cat aag gaa tac ttg ctg get ggg gca gat ate att gaa aca aat 288 
lie His Lys Glu Tyr Leu Leu Ala Gly Ala Asp lie lie Glu Thr Asn 
85 90 9S 

act ttt age age act agt att gee caa get gac tat ggc ctt gaa cac 336 
Thr Phe Ser Ser Thr Ser lie Ala Gin Ala Asp Tyr Gly Leu Glu His 
100 105 no 

ttg gee tac egg atg aac atg tgc tct gca gga gtg gec aga aaa get 384 
Leu Ala Tyr Arg Met Asn Met Cys Ser Ala Gly Val Ala Arg Lys Ala 
115 120 125 

gee gag gag gta act etc cag aca gga att aag agg ttt gtg gca ggg 432 
Ala Glu Glu Val Thr Leu Gin Thr Gly lie Lys Arg Phe Val Ala Gly 
130 135 * 140 

get ctg ggt ccg act aat aag aca etc tct gtg tec cca tct gtg gaa 480 
Ala Leu Gly Pro Thr Asn Lys Thr Leu Ser Val Ser Pro Ser Val Glu 
145 150 155 160 

agg ccg gat tat agg aac ate aca ttt gat gag ctt gtt gaa gca tac 528 
Arg Pro Asp Tyr Arg Asn He Thr Phe Asp Glu Leu Val Glu Ala Tyr 
165 170 175 

caa gag cag gee aaa gga ctt ctg gat ggc ggg gtt gat ate tta etc 576 
Gin Glu Gin Ala Lys Gly Leu Leu Asp Gly Gly Val Asp He Leu Leu 
180 185 190 

att gaa act att ttt gat act gee aat gee aag gca gee ttg ttt gca 624 
lie Glu Thr He Phe Asp Thr Ala Asn Ala Lys Ala Ala Leu Phe Ala 
195 200 205 

etc caa aat ctt ttt gag gag aaa tat get ccc egg cct ate ttt att 672 
Leu Gin Asn Leu Phe Glu Glu Lys Tyr Ala Pro Arg Pro He Phe He 
210 215 220 

tea ggg acg ate gtt gat aaa agt ggg egg act ctt tec gga cag aca 720 
Ser Gly Thr He Val Asp Lys Ser Gly Arg Thr Leu Ser Gly Gin Thr 
225 230 235 * 240 

gga gag gga ttt gtc ate age gtg tct cat gga gaa cca etc tgc att 768 
Gly Glu Gly Phe Val He Ser Val Ser His Gly Glu Pro Leu Cys He 
245 250 255 

gga tta aat tgt get ttg ggt gca get gaa atg aga cct ttt att gaa 816 
Gly Leu Asn Cys Ala Leu Gly Ala Ala Glu Met Arg Pro Phe He Glu 
260 265 " 270 

ata att gga aaa tgt aca aca gee tat gtc etc tgt tat ccc aat gca 864 
He He Gly Lys Cys Thr Thr Ala Tyr Val Leu Cys Tyr Pro Asn Ala 
275 280 285 



ggt ctt ccc aac ace ttt ggt gac tat gat gaa acg cct tct atg atg 
Gly Leu Pro Asn Thr Phe Gly Asp Tyr Asp Glu Thr Pro Ser Met Met 
290 295 300 



912 



gee aag cac eta aag gat ttt get atg gat ggc ttg gtc aat ata gtt 960 
Ala Lys His Leu Lys Asp Phe Ala Met Asp Gly Leu Val Asn He Val 
305 310 315 320 
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gga gga tgc tgt ggg tea aca cca gat cat ate agg gaa att get gaa 1008 
Gly Gly Cys Cys Gly Ser Thr Pro Asp Hie lie Arg Glu lie Ala Glu 
325 330 335 

get gtg aaa aat tgt aag cct aga gtt cca cct gee act get ttt gaa 1056 
Ala Val Lys Asn Cys Lys Pro Arg Val Pro Pro Ala Thr Ala Phe Glu 
340 345 350 

gga cat atg tta ctg tct ggt eta gag ccc ttc agg att gga ccg tac 1104 
Gly His Met Leu Leu Ser Gly Leu Glu Pro Phe Arg lie Gly Pro Tyr 
355 360 365 

ace aac ttt gtt aac att gga gag cgc tgt aat gtt gca gga tea agg 1152 
Thr Asn Phe Val Asn lie Gly Glu Arg Cys Asn Val Ala Gly Ser Arg 
370 375 380 

aag ttt get aaa etc ate atg gca gga aac tat gaa gaa gee ttg tgt 1200 
Lys Phe Ala Lys Leu lie Met Ala Gly Asn Tyr Glu Glu Ala Leu Cys 
385 390 395 400 

gtt gec aaa gtg cag gtg gaa atg gga gec cag gtg ttg gat gtc aac 1248 
Val Ala Lys Val Gin Val Glu Met Gly Ala Gin Val Leu Asp Val Asn 
405 410 415 

atg gat gat ggc atg eta gat ggt cca agt gca atg acc aga ttt tgc 1296 
Met Asp Asp Gly Met Leu Asp Gly Pro Ser Ala Met Thr Arg Phe Cys 
420 425 430 

aac tta att get tec gag cca gac ate gca aag gta cct ttg tgc ate 1344 
Asn Leu He Ala Ser Glu Pro Asp He Ala Lys Val Pro Leu Cys He 
435 440 445 

gac tec tec aat ttt get gtg att gaa get ggg tta aag tgc tgc caa 1392 
Asp Ser Ser Asn Phe Ala Val He Glu Ala Gly Leu Lys Cys Cys Gin 
450 455 460 

ggg aag tgc att gtc aat age att agt ctg aag gaa gga gag gac gac 1440 
Gly Lys Cys He Val Asn Ser He Ser Leu Lys Glu Gly Glu Asp Asp 
465 470 475 480 

ttc ttg gag aag gec agg aag att aaa aag tat gga get get atg gtg 1488 
Phe Leu Glu Lys Ala Arg Lys He Lys Lys Tyr Gly Ala Ala Met Val 
485 490 495 

gtc atg get ttt gat gaa gaa gga cag gca aca gaa aca gac aca aaa 1536 
Val Met Ala Phe Asp Glu Glu Gly Gin Ala Thr Glu Thr Asp Thr Lys 
500 505 510 

ate aga gtg tgc acc egg gee tac cat ctg ctt gtg aaa aaa ctg ggc 1584 
He Arg Val Cys Thr Arg Ala Tyr His Leu Leu Val Lys LyB Leu Gly 
515 520 525 

ttt aat cca aat gac att att ttt gac cct aat ate eta acc att ggg 1632 
Phe Asn Pro Asn Asp He He Phe Asp Pro Asn He Leu Thr He Gly 
530 535 540 

act gga atg gag gaa cac aac ttg tat gee att aat ttt ate cat gca 1680 
Thr Gly Met Glu Glu His Asn Leu Tyr Ala He Asn Phe He His Ala 
545 550 555 560 

aca aaa gtc att aaa gaa aca tta cct gga gee aga ata agt gga ggt 1728 
Thr Lys Val He Lys Glu Thr Leu Pro Gly Ala Arg He Ser Gly Gly 
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565 570 575 

ctt tec aac ttg tec ttc tec ttc cga gga atg gaa gec att cga gaa 1776 
Leu Ser Asn Leu Ser Phe Ser Phe Arg Gly Met Glu Ala He ,Arg Glu 
580 585 590 

gca atg cat ggg gtt ttc ctt tac cat gca ate aag tct ggc atg gac 1824 
Ala Met His Gly Val Phe Leu Tyr His Ala He Lys Ser Gly Met Asp 
595 600 605 

atg ggg ata gtg aat get gga aac etc cct gtg tat gat gat ate cat 1872 
Met Gly He Val Asn Ala Gly Asn Leu Pro Val Tyr Asp Asp He His 
610 615 620 

aag gaa ctt ctg cag etc tgt gaa gat etc ate tgg aat aaa gac cct 1920 
Lys Glu Leu Leu Gin Leu Cys Glu Asp Leu He Trp Asn Lys Asp Pro 
625 630 635 640 

gag gec act gag aag etc tta cgt tat gee cag act caa ggc aca gga 1968 
Glu Ala Thr Glu Lys Leu Leu Arg Tyr Ala Gin Thr Gin Gly Thr Gly 
645 650 655 

ggg aag aaa gtc att cag act gat gag tgg aga aat ggc cct gtc gaa 2016 
Gly Lys Lys Val He Gin Thr Asp Glu Trp Arg Asn Gly Pro Val Glu 
660 665 670 

gaa cgc ctt gag tat gee ctt gtg aag ggc att gaa aaa cat att att 2064 
Glu Arg Leu Glu Tyr Ala Leu Val Lys Gly He Glu Lys His, He He 
675 680 685 

gag gat act gag gaa gee agg tta aac caa aaa aaa tat ccc cga cct 2112 
Glu Asp Thr Glu Glu Ala Arg Leu Asn Gin Lys Lys Tyr Pro Arg Pro 
690 695 700 

etc aat ata att gaa gga ccc ctg atg aat gga atg aaa att gtt ggt 2160 
Leu Asn He He Glu Gly Pro Leu Met Asn Gly Met Lys He Val Gly 
705 710 715 720 

gat ctt ttt gga get gga aaa atg ttt eta cct cag gtt ata aag tea 2208 
Asp Leu Phe Gly Ala Gly Lys Met Phe Leu Pro Gin Val He Lys Ser 
725 730 735 

gee egg gtt atg aag aag get gtt ggc cac ctt ate cct ttc atg gaa 2256 
Ala Arg Val Met Lys Lys Ala Val Gly His Leu He Pro Phe Met Glu 
740 745 750 



aaa gaa aga gaa gaa ace aga gtg ctt aac ggc aca gta gaa gaa gag 2304 
Lys Glu Arg Glu Glu Thr Arg Val Leu Asn Gly Thr Val Glu Glu Glu 
755 760 765 

gac cct tac cag ggc ace ate gtg ctg gee act gtt aaa ggc gac gtg 2352 
Asp Pro Tyr Gin Gly Thr He Val Leu Ala Thr Val Lys Gly Asp Val 
770 775 780 

cac gac ata ggc aag aac ata gtt gga gta gtc ctt ggc tgc aat aat 2400 
His Asp He Gly Lys Asn He Val Gly Val Val Leu Gly Cys Asn Asn 
785 790 795 800 

ttc cga gtt att gat tta gga gtc atg act cca tgt gat aag ata ctg 2448 
Phe Arg Val He Asp Leu Gly Val Met Thr Pro Cys Asp Lys He Leu 
805 810 815 
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aaa get get ctt gac cac aaa gca gat ata att ggc ctg tea gga etc 2496 
Lys Ala Ala Leu Asp His Lys Ala Asp He He Qly Leu Ser Gly Leu 
820 825 830 

ate act cct tec ctg gat gaa atg att ttt gtt gee aag gaa atg gag 2544 
lie Thr Pro Ser Leu Asp Glu Met He Phe Val Ala Lys Glu Met Glu 
835 840 845 

aga tta get ata agg att cca ttg ttg att gga gga gca ace act tea 2592 
Arg Leu Ala He Arg He Pro Leu Leu He Gly Gly Ala Thr Thr Ser 
850 855 660 

aaa ace cac aca gca gtt aaa ata get ccg aga tac agt gca cct gta 2640 
Lye Thr His Thr Ala Val Lys He Ala Pro Arg Tyr Ser Ala Pro Val 
865 870 875 880 

ate cat gtc ctg gac gcg tec aag agt gtg gtg gtg tgt tec cag ctg 2688 
He His Val Leu Asp Ala Ser Lys Ser Val Val Val Cys Ser Gin Leu 
885 890 895 

tta gat gaa aat eta aag gat gaa tac ttt gag gaa ate atg gaa gaa 2736 
Leu Asp Glu Asn Leu Lys Asp Glu Tyr Phe Glu Glu He Met Glu Glu 
900 905 910 

tat gaa gat att aga cag gac cat tat gag tct etc aag gag agg aga 2784 
Tyr Glu Asp He Arg Gin Asp His Tyr Glu Ser Leu Lys Glu Arg Arg 
915 920 925 

tac tta ccc tta agt caa gee aga aaa agt ggt ttc caa atg gat tgg 2832 
Tyr Leu Pro Leu Ser Gin Ala Arg Lys Ser Gly Phe Gin Met Asp Trp 
930 935 940 

ctg tct gaa cct cac cca gtg aag ccc acg ttt att ggg acc cag gtc 2880 
Leu Ser Glu Pro His Pro Val Lys Pro Thr Phe He Gly Thr Gin Val 
945 950 955 960 

ttt gaa gac tat gac ctg cag aag ctg gtg gac tac att gac tgg aag 2928 
Phe Glu Asp Tyr Asp Leu Gin Lys Leu Val Asp Tyr He Asp Trp Lys 
965 970 975 

cct ttc ttt gat gtc tgg cag etc egg ggc aag tac ccg aat cga ggc 2976 
Pro Phe Phe Asp Val Trp Gin Leu Arg Gly Lys Tyr Pro Asn Arg Gly 
980 985 * 990 

ttt ccc aag ata ttt aac gac aaa aca gta ggt gga gag gee agg aag 3024 
Phe Pro LyB He Phe Asn Asp Lys Thr Val Gly Gly Glu Ala Arg Lys 
995 1000 1005 

gtc tac gat gat gee cac aat atg ctg aac aca ctg att agt caa aag 3072 
Val Tyr Asp Asp Ala His Asn Met Leu Asn Thr Leu He Ser Gin Lys 
1010 1015 1020 

aaa etc egg gee egg ggt gtg gtt ggg ttc tgg cca gca cag agt ate 3120 
Lys Leu Arg Ala Arg Gly Val Val Gly Phe Trp Pro Ala Gin Ser He 
1025 1030 1035 1040 

caa gac gac att cac ctg tac gcg gag get get gtg ccc cag get gca 3168 
Gin Asp Asp He His Leu Tyr Ala Glu Ala Ala Val Pro Gin Ala Ala 
1045 1050 1055 

gag ccc ata gee acc ttc tat ggg tta agg caa cag get gag aag gac 3216 
Glu Pro He Ala Thr Phe Tyr Gly Leu Arg Gin Gin Ala Glu Lys Asp 
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1060 1065 1070 

tct gcc age acg gag cca tac tac tgc etc tea gac ttc ate get ccc 3264 
Ser Ala Ser Thr Glu Pro Tyr Tyr Cys Leu Ser Asp Phe lie Ala Pro 
107S 1080 1085 

ttg cat tct ggc ate cgt gac tac ctg ggc ctg ttt gcc gtt gcc tgc 3312 
Leu His Ser Gly lie Arg Asp Tyr Leu Gly Leu Phe Ala Val Ala Cys 
1090 1095 1100 

ttt ggg gta gaa gag ctg age aag gcc tat gag gat gat ggt gac gac 3360 
Phe Gly Val Glu Glu Leu Ser Lys Ala Tyr Glu Asp Asp Gly Asp Asp 
1105 1110 1115 1120 

tac age age ate atg gtc aag gcg ctg ggg gac egg ctg gca gag gcc 3408 
Tyr Ser Ser He Met Val Lys Ala Leu Gly Asp Arg Leu Ala Glu Ala 
1125 1130 *" 1135 

ttt gca gaa gag etc cat gaa aga gtt cgc cga gaa ctg tgg gcc tac 3456 
Phe Ala Glu Glu Leu His Glu Arg Val Arg Arg Glu Leu Trp Ala Tyr 
1140 1145 1150 

tgt ggc agt gag cag ctg gac gtc gca gac ctg cgc agg ctg egg tac 3504 
Cys Gly Ser Glu Gin Leu Asp Val Ala Asp Leu Arg Arg Leu Arg Tyr 
1155 1160 1165 

aag ggc ate cgc ccg get cct ggc tac ccc age cag ccc gac cac ace 3552 
Lys Gly He Arg Pro Ala Pro Gly Tyr Pro Ser Gin Pro Asp His Thr 
1170 1175 1180 

gag aag etc acc atg tgg aga ctt gca gac ate gag cag tct aca ggc 3600 
Glu Lys Leu Thr Met Trp Arg Leu Ala Asp He Glu Gin Ser Thr Gly 
1185 1190 1195 1200 

att agg tta aca gaa tea tta gca atg gca cct get tea gca gtc tea 3648 
He Arg Leu Thr Glu Ser Leu Ala Met Ala Pro Ala Ser Ala Val Ser 
1205 1210 1215 

ggc etc tac ttc tec aat ttg aag tec aaa tat ttt get gtg ggg aag 3696 
Gly Leu Tyr Phe Ser Asn Leu Lys Ser Lys Tyr Phe Ala Val Gly Lys 
1220 1225 1230 

att tec aag gat cag gtt gag gat tat gca ttg agg aag aac ata tct 3744 
lie Ser Lys Asp Gin Val Glu Asp Tyr Ala Leu Arg Lys Asn He Ser 
1235 1240 1245 

gtg get gag gtt gag aaa tgg ctt gga ccc att ttg gga tat gat aca 3792 
Val Ala Glu Val Glu Lys Trp Leu Gly Pro He Leu Gly Tyr Asp Thr 
1250 1255 1260 

gac taa 3798 

Asp 

1265 



<210> 44 
<211> 1265 
<212> PRT 

<213> Homo sapiens 
<400> 44 

Met Ser Pro Ala Leu Gin Asp Leu Ser Gin Pro Glu Gly Leu Lys Lys 



( 
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15 10 15 

Thr Leu Arg Asp Glu He Asn Ala He Leu Gin Lys Arg He Met Val 
20 25 30 

Leu Asp Gly Gly Met Gly Thr Met He Gin Arg Glu Lys Leu Asn Glu 
35 40 45 

Glu His Phe Arg Gly Gin Glu Phe Lys Asp His Ala Arg Pro Leu Lys 
50 55 60 

Gly Asn Asn Asp He Leu Ser He Thr Gin Pro Asp Val He Tyr Gin 
65 70 75 80 

He His Lys Glu Tyr Leu Leu Ala Gly Ala Asp He He Glu Thr Asn 
85 90 95 

Thr Phe Ser Ser Thr Ser He Ala Gin Ala Asp Tyr Gly Leu Glu His 
100 105 110 

Leu Ala Tyr Arg Met Asn Met Cys Ser Ala Gly Val Ala Arg Lys Ala 
115 120 125 

Ala Glu Glu Val Thr Leu Gin Thr Gly He Lys Arg Phe Val Ala Gly 
130 135 140 

Ala Leu Gly Pro Thr Asn Lys Thr Leu Ser Val Ser Pro Ser Val Glu 
145 150 155 160 

Arg Pro Asp Tyr Arg Asn He Thr Phe Asp Glu Leu Val Glu Ala Tyr 
165 170 175 

Gin Glu Gin Ala Lys Gly Leu Leu Asp Gly Gly Val Asp He Leu Leu 
180 185 190 

He Glu Thr He Phe Asp Thr Ala Asn Ala Lys Ala Ala Leu Phe Ala 
195 200 205 

Leu Gin Asn Leu Phe Glu Glu Lys Tyr Ala Pro Arg Pro He Phe He 
210 215 220 

Ser Gly Thr He Val Asp Lys Ser Gly Arg Thr Leu Ser Gly Gin Thr 
225 230 235 240 

Gly Glu Gly Phe Val He Ser Val Ser His Gly Glu Pro Leu Cys He 
245 250 255 

Gly Leu Asn Cys Ala Leu Gly Ala Ala Glu Met Arg Pro Phe He Glu 
260 265 270 

He He Gly Lys Cys Thr Thr Ala Tyr Val Leu Cys Tyr Pro Asn Ala 
275 280 285 

Gly Leu Pro Asn Thr Phe Gly Asp Tyr Asp Glu Thr Pro Ser Met Met 
290 295 300 

Ala Lys His Leu Lys Asp Phe Ala Met Asp Gly Leu Val Asn He Val 
305 310 315 320 

Gly Gly Cys Cys Gly Ser Thr Pro Asp His He Arg Glu He Ala Glu 
325 330 335 
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Ala Val Lye Asn Cys Lys Pro Arg Val Pro Pro Ala Thr Ala Phe Glu 
340 345 35 0 

Gly Hi b Met Leu Leu Ser Gly Leu Glu Pro Phe Arg lie Gly, Pro Tyr 
355 360 365 

Thr Asn Phe Val Asn He Gly Glu Arg Cys Asn Val Ala Gly Ser Arg 
370 375 300 

Lys Phe Ala Lys Leu He Met Ala Gly. Asn Tyr Glu Glu Ala Leu Cys 
385 390 395 400 

Val Ala Lys Val Gin Val Glu Met Gly Ala Gin Val Leu Asp Val Asn 
405 410 415 

Met Asp Asp Gly Met Leu Asp Gly Pro Ser Ala Met Thr Arg Phe Cys 
420 425 430 

Asn Leu He Ala Ser Glu Pro Asp lie Ala Lys Val Pro Leu Cys He 
435 440 445 

Asp ser Ser Asn Phe Ala Val He Glu Ala Gly Leu Lys Cys Cys Gin 
450 455 460 

Gly Lys Cys He Val Asn Ser He Ser Leu Lys Glu Gly Glu Asp Asp 
465 470 475 4B0 

Phe Leu Glu LyB Ala Arg Lys He Lys Lys Tyr Gly Ala Ala Met Val 
485 490 495 

Val Met Ala Phe Asp Glu Glu Gly Gin Ala Thr Glu Thr Asp Thr Lys 
500 505 510 

He Arg Val Cys Thr Arg Ala Tyr His Leu Leu Val Lys Lys Leu Gly 
515 520 525 

Phe Asn Pro Asn Asp He He Phe Asp Pro Asn He Leu Thr He Gly 
530 535 540 

Thr Gly Met Glu Glu His Asn Leu Tyr Ala He Asn Phe He His Ala 
545 550 555 560 

Thr Lys Val He Lys Glu Thr Leu Pro Gly Ala Arg He Ser Gly Gly 
565 570 575 

Leu Ser Asn Leu Ser Phe Ser Phe Arg Gly Met Glu Ala He Arg Glu 
580 585 590 

Ala Met His Gly Val Phe Leu Tyr His Ala He Lys Ser Gly Met Asp 
595 600 605 

Met Gly He Val Asn Ala Gly Asn Leu Pro Val Tyr Asp Asp He His 
610 615 620 

Lys Glu Leu Leu Gin Leu Cys Glu Asp Leu He Trp Asn Lys Asp Pro 
625 630 635 640 

Glu Ala Thr Glu Lys Leu Leu Arg Tyr Ala Gin Thr Gin Gly Thr Gly 
645 650 655 

Gly Lys Lys Val He Gin Thr Asp Glu Trp Arg Asn Gly Pro Val Glu 
660 665 670 
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Glu Arg Leu Glu Tyr Ala Leu Val Lys Gly He Glu Lys His He He 
675 680 685 

Glu Asp Thr Glu Glu Ala Arg Leu Asn Gin Lys Lys Tyr Pro Arg Pro 
690 695 700 

Leu Asn He He Glu Gly Pro Leu Met Aen Gly Met Lys He Val Gly 
705 710 715 720 

Asp Leu Phe Gly Ala Gly Lys Met Phe Leu Pro Gin Val He Lys Ser 
725 730 735 

Ala Arg Val Met Lys Lys Ala Val Gly His Leu He Pro Phe Met Glu 
740 745 750 

Lys Glu Arg Glu Glu Thr Arg Val Leu Asn Gly Thr Val Glu Glu Glu 
755 760 765 

Asp Pro Tyr Gin Gly Thr He Val Leu Ala Thr Val LyB Gly Asp Val 
770 775 780 

His Asp He Gly Lys Asn He Val Gly Val Val Leu Gly Cys Asn Asn 

_. 7 M 790 795 . 800 

Phe Arg Val He Asp Leu Gly Val Met Thr Pro Cys Asp Lys He Leu 
805 810 815 

i 

Lys Ala Ala Leu Asp His Lys Ala Asp lie He Gly Leu Ser Gly Leu 
820 825 " 830 

He Thr Pro Ser Leu Asp Glu Met He Phe Val Ala Lys Glu Met Glu 
835 840 845 

Arg Leu Ala He Arg He Pro Leu Leu He Gly Gly Ala Thr Thr Ser 
850 855 860 

Lys Thr His Thr Ala Val Lys He Ala Pro Arg Tyr Ser Ala Pro Val 
865 870 875 880 

lie His Val Leu Asp Ala Ser Lys Ser Val Val Val Cys Ser Gin Leu 
885 890 895 

Leu Asp Glu Asn Leu Lys Asp Glu Tyr Phe Glu Glu He Met Glu Glu 
900 905 910 

Tyr Glu Asp He Arg Gin Asp His Tyr Glu Ser Leu Lys Glu Arg Arg 
915 920 925 

Tyr Leu Pro Leu Ser Gin Ala Arg Lys Ser Gly Phe Gin Met Asp Trp 
930 935 940 

Leu Ser Glu Pro His Pro Val Lys Pro Thr Phe He Gly Thr Gin Val 
945 950 955 960 

Phe Glu Asp Tyr Asp Leu Gin Lys Leu Val Asp Tyr He Asp Trp Lys 
965 970 975 

Pro Phe Phe Asp Val Trp Gin Leu Arg Gly Lys Tyr Pro Asn Arg Gly 
980 985 990 

Phe Pro Lys He Phe Asn Asp Lys Thr Val Gly Gly Glu Ala Arg Lys 
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995 1000 1005 

Val Tyr Asp Asp Ala His Asn Met Leu Asn Thr Leu lie Ser Gin Lys 
1010 1015 1020 

Lys Leu Arg Ala Arg Gly Val Val Gly Phe Trp Pro Ala Gin Ser He 
102 5 1030 1035 1040 

Gin Asp Asp He His Leu Tyr Ala Glu Ala Ala Val Pro Gin Ala Ala 
X045 1050 .,-1055 

Glu Pro He Ala Thr Phe Tyr Gly Leu Arg Gin Gin Ala Glu Lys Asp 
1060 1065 1070 

Ser Ala Ser Thr Glu Pro Tyr Tyr Cys Leu Ser Asp Phe He Ala Pro 
1075 1080 1085 

Leu His Ser Gly He Arg Asp Tyr Leu Gly Leu Phe Ala Val Ala Cys 
1090 1095 1100 

Phe Gly Val Glu Glu Leu Ser Lys Ala Tyr Glu Asp Asp Gly Asp Asp 
IHO 1115 1120 

Tyr Ser Ser He Met Val Lys Ala Leu Gly Asp Arg Leu Ala Glu Ala 
1125 1130 H35 

Phe Ala Glu Glu Leu His Glu Arg Val Arg Arg Glu Leu Trp Ala Tyr 
1140 H45 " ~ us© 

Cys Gly Ser Glu Gin Leu Asp Val Ala Asp Leu Arg Arg Leu Arg Tyr 
1155 H60 H65 

Lys Gly He Arg Pro Ala Pro Gly Tyr Pro Ser Gin Pro Asp His Thr 
1170 H75 1180 

Glu Lys Leu Thr Met Trp Arg Leu Ala Asp He Glu Gin Ser Thr Gly 
1185 1190 H95 1200 

He Arg Leu Thr Glu Ser Leu Ala Met Ala Pro Ala Ser Ala Val Ser 
1205 1210 1215 

Gly Leu Tyr Phe Ser Asn Leu Lys Ser Lys Tyr Phe Ala Val Gly Lys 
1220 1225 1230 

He Ser Lys Asp Gin Val Glu Asp Tyr Ala Leu Arg Lys Asn He Ser 
1235 1240 1245 

Val Ala Glu Val Glu Lys Trp Leu Gly Pro He Leu Gly Tyr Asp Thr 
1250 1255 1260 

Asp 
1265 



<210> 45 
<211> 3681 
<212> DNA 

<213> Vibrio fisheri 

<220> 

<221> CDS 

<222> (1) . . (3678) 
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<223> AB039955 
<400> 45 

gtg gca gga age aat ata aaa gta caa ata gaa aag caa ctt tea gag 48 
Val Ala Gly Ser Asn lie Lys Val Gin lie Glu Lys Gin Leu Ser Glu 
1 5 io is 



cga att tta ttg att gat ggt ggt atg ggc ace atg att caa ggt tat 
Arg He Leu Leu He Asp Gly Gly Met Gly Thr Met He Gin Gly Tyr 
20 25 30 



96 



aag ttt gaa gag aaa gat tat aga ggg gga cgc ttt aat caa tgg cat 144 
Lys Phe Glu Glu Lye Asp Tyr Arg Gly Gly Arg Phe Asn Gin Trp His 
35 40 45 

tgt gat ctt aaa ggt aac aat gat tta tta gtt ctt tea caa cca caa 192 
Cys Asp Leu Lys Gly Asn Asn Asp Leu Leu Val Leu Ser Gin Pro Gin 
50 55 60 

att ata aga gat ata cac gaa gee tat tta gaa get ggt get gat ate 240 
He He Arg Asp He His Glu Ala Tyr Leu Glu Ala Gly Ala Asp He 
65 70 75 80 

ctt gaa act aat ace ttt aat gca aca act att get atg get gat tat 288 
Leu Glu Thr Asn Thr Phe Asn Ala Thr Thr He Ala Met Ala Asp Tyr 
85 90 95 

gat atg gaa age ctt agt gaa gag att aac ttt gaa gca gca aag ctt 336 
Asp Met Glu Ser Leu Ser Glu Glu He Asn Phe Glu Ala Ala Lys Leu 
100 105 110 

get cgt gaa gtt gca gat aaa tgg aca gaa aaa aca cca aac aaa cct 384 
Ala Arg Glu Val Ala Asp Lys Trp Thr Glu Lys Thr Pro Asn Lys Pro 
115 120 125 

cgc tat gta gca gga gtg ctt gga cca aca aat cga act tgt tct att 432 
Arg Tyr Val Ala Gly Val Leu Gly Pro Thr Asn Arg Thr Cys Ser He 
130 135 140 

tct cca gac gta aat gac cct ggc ttt cgt aat gta teg ttt gat gaa 480 
Ser Pro Asp Val Asn Asp Pro Gly Phe Arg Asn Val Ser Phe Asp Glu 
145 150 155 160 

tta gtc gaa get tat tea gag tea act cga gca ctt att aga ggt ggt 528 
Leu Val Glu Ala Tyr Ser Glu Ser Thr Arg Ala Leu He Arg Gly Gly 
165 170 175 

tea gat ctt ate etc ate gaa act ata ttt gat aca tta aat get aaa 576 
Ser Asp Leu He Leu He Glu Thr He Phe Asp Thr Leu Asn Ala Lys 
180 185 190 

gcg tgt tct ttt get gtt gaa tct gtt ttt gaa gag ctt ggt att act 624 
Ala Cys Ser Phe Ala Val Glu Ser Val Phe Glu Glu Leu Gly He Thr 
195 200 205 

ttg cct gtt atg att tea ggg acc att acc gat gca tea gga aga aca 672 
Leu Pro Val Met He Ser Gly Thr He Thr Asp Ala Ser Gly Arg Thr 
210 215 220 

tta teg ggg caa aca aca gaa get ttt tat aat gca tta aga cat gta 720 
Leu Ser Gly Gin Thr Thr Glu Ala Phe Tyr Asn Ala Leu Arg His Val 
225 2 30 235 240 
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aaa cct att tct ttt ggt ctt aac tgt gca ctt ggt cct gat gaa tta 768 
Lys Pro He Ser Phe Gly Leu Asn CyB Ala Leu Gly Pro Asp Glu Leu 
24 5 250 ,255 

cgt gaa tat gta age gag ctt tea cgt att tct gaa tgt tat gtt tct 816 
Arg Glu Tyr Val Ser Glu Leu Ser Arg He Ser Glu Cys Tyr Val Ser 
260 265 270 

gcg cac cca aac get ggt ttg cct aat gca ttt ggt gag tat gat tta 864 
Ala His Pro Asn Ala Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu 
275 280 205 

tct ccc gaa gat atg get gag cat gtt gcg gaa tgg gca age age gga 912 
Ser Pro Glu Asp Met Ala Glu His Val Ala Glu Trp Ala Ser Ser Gly 
290 295 300 

ttt tta aat ctt att ggt ggg tgt tgt ggc ace act cct gaa cat att 960 
Phe Leu Asn Leu He Gly Gly Cys Cys Gly Thr Thr Pro Glu His He 
305 310 315 320 

cgt caa atg get tta gtt gtt gaa ggt gtg aaa cct cga caa tta cct 1008 
Arg Gin Met Ala Leu Val Val Glu Gly Val Lys Pro Arg Gin Leu Pro 
325 330 ~ 335 

gaa tta ccc gta get tgt cgt ctt tec gga tta gag cct tta aca ata 1056 
Glu Leu Pro Val Ala Cys Arg Leu Ser Gly Leu Glu Pro Leu Thr He 
340 345 350 

gaa aaa gat tct ttg ttt att aat gtt ggt gaa cgt aca aat gtt act 1104 
Glu Lys Asp Ser Leu Phe He Asn Val Gly Glu Arg Thr Asn Val Thr 
355 360 365 

gga tct gca cgt ttt aaa cgc tta att aaa gaa gag ctt tat gac gaa 1152 
Gly Ser Ala Arg Phe Lys Arg Leu He Lys Glu Glu Leu Tyr Asp Glu 
370 375 380 

gca eta agt gtt get caa gag caa gtt gaa aac ggt get caa att ate 1200 
Ala Leu Ser Val Ala Gin Glu Gin Val Glu Asn Gly Ala Gin He He 
385 390 395 400 

gat ate aac atg gat gaa ggc atg ctt gat get gaa gca tgt atg gtt 1248 
Asp He Asn Met Asp Glu Gly Met Leu Asp Ala Glu Ala Cys Met Val 
405 410 415 

cgt ttt tta aat ctt tgt gca tea gaa cct gaa ata tct aaa gta cca 1296 
Arg Phe Leu Asn Leu Cys Ala Ser Glu Pro Glu He Ser Lys Val Pro 
420 425 430 

gtg atg gtt gat tct tct aaa tgg gaa gta att gaa get gga tta aag 1344 
Val Met Val Asp Ser Ser Lys Trp Glu Val lie Glu Ala Gly Leu Lys 
435 440 445 

tgt att caa ggt aag ggg ata gtt aat tea ate tct tta aag gaa ggc 1392 
Cys lie Gin Gly Lys Gly He Val Asn Ser He Ser Leu Lys Glu Gly 
450 455 460 

aaa gaa aag ttt gta cat caa gee aag tta ata cgt cgt tat ggt get 1440 
Lys Glu Lys Phe Val His Gin Ala Lys Leu lie Arg Arg Tyr Gly Ala 
465 470 475 480 

gca gtg ate gtt atg get ttt gat gaa gtt ggc caa gcg gac act egg 1488 
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Ala Val He Val Met Ala Phe Asp Glu Val Gly Gin Ala Asp Thr Arg 
485 490 495 

gag cgt aaa att gaa att tgt acc aat gcc tac aat att tta gtt gat 1536 
Glu Arg Lys He Glu He Cye Thr Asn Ala Tyr Asn He Leu Val Asp 
500 505 510 

gaa gtt ggc ttc cca cct gaa gat att att ttt gac cct aat att ttt 1584 
Glu Val Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Asn He Phe 
515 520 525 

gcg gtt get aca ggt ate gat gaa cat aat aac tat gca gta gac ttt 1632 
Ala Val Ala Thr Gly He Asp Glu His Asn Asn Tyr Ala Val Asp Phe 
530 535 540 

att gaa gcc gtt ggt gat ata aag cga acg ctt cct cat gca atg att 1680 
He Glu Ala Val Gly Asp He Lys Arg Thr Leu Pro His Ala Met He 
5 *5 550 555 560 

tea ggt ggt gtt tct aac gtc tct ttt tct ttc cgt gga aat aac tac 1728 
Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asn Tyr 
565 570 575 

gtt cgt gaa get ate cat gcc gta ttt tta.tat. cac_tgt ttt aaa aat 1776 
Val Arg Glu Ala He His Ala Val Phe Leu Tyr His Cys Phe Lys Asn 
580 585 590 

ggt atg gat atg ggc ate gta aat gcg ggg cag ctg gaa ata tat gat 1824 
Gly Met Asp Met Gly lie Val Asn Ala Gly Gin Leu Glu He Tyr Asp 
595 600 605 

aac gta cca gaa gat ctg cgt gaa gcg gtt gaa gat gtg gta ttg aat 1872 
Asn Val Pro Glu Asp Leu Arg Glu Ala Val Glu Asp Val Val Leu Asn 
610 615 620 

cgt cga gat gat tct acg gag cgt tta ctt gat att gca act gag tat 1920 
Arg Arg Asp Asp Ser Thr Glu Arg Leu Leu Asp He Ala Thr Glu Tyr 
"5 630 635 640 

tta gaa cga get gtt ggt aaa gtt gaa gat aaa tct get tta gag tgg 1968 
Leu Glu Arg Ala Val Gly Lys Val Glu Asp Lys Ser Ala Leu Glu Trp 
645 650 655 

cgt gac tgg cct gtt gaa aaa cgt ctt gag cat tct eta gtg aag ggg 2016 
Arg Asp Trp Pro Val Glu Lys Arg Leu Glu HiB Ser Leu Val Lys Gly 
660 665 670 

ata aca gag ttt att gtc gaa gat aca gaa gaa gca cga ate aat gca 2064 
lie Thr Glu Phe He Val Glu Asp Thr Glu Glu Ala Arg He Asn Ala 
675 680 685 

gaa aga cca ata gag gta att gaa ggg cca ttg atg gac gga atg aac 2112 
Glu Arg Pro lie Glu Val lie Glu Gly Pro Leu Met Asp Gly Met Asn 
690 695 700 

gtc gtt ggt gat ctt ttt ggg gaa gga aaa atg ttc ctt ccc caa gta 2160 
Val Val Gly Asp Leu Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val 
705 710 715 720 



gta aag tct get cgt gta atg aaa caa get gtt get cat tta gaa ccg 
Val Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala His Leu Glu Pro 
725 730 735 



2208 
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ttt att aat gcg tct aaa gaa gtt gga gca aca aac ggt aaa ata ctt 2256 
Phe He Asn Ala Ser Lys Glu Val Gly Ala Thr Asn Gly Lys He Leu 
740 745 750, 

tta gca aca gta aaa ggt gat gtt cat gat att ggt aag aat ate gtt 2304 
Leu Ala Thr Val Lys Gly Asp Val His Asp lie Gly Lys Asn He Val 
755 760 765 

ggc gtg gtt tta cag tgt aat aac tat gaa ata att gat ctt ggt gtc 2352 
Gly Val Val Leu Gin Cys Asn Asn Tyr Glu He He Asp Leu Gly Val 
770 775 780 

atg gtc tct tgt gaa act ate tta aaa gta gec aaa gaa gaa aat gta 2400 
Met Val Ser Cys Glu Thr He Leu Lys Val Ala Lys Glu Glu Asn Val 
78S 790 795 800 

gac ate att ggt tta tct gga tta ata aca cca tea tta gat gaa atg 2448 
Asp He He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met 
805 810 815 

gtc cat gtt get aaa gag atg gaa cga caa ggg ttt gat tta cca ttg 2496 
Val His Val Ala Lys Glu Met Glu Arg Gin Gly Phe Asp Leu Pro Leu 
820 825 830 

ttg att ggt gga gca aca act tea aaa gca cat aca gcg gta aaa att 2544 
Leu He Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys He 
835 840 845 

gaa caa aac tat tct caa cct gtt gtg tac gtt aat aat get tct cga 2592 
Glu Gin Asn Tyr Ser Gin Pro Val Val Tyr Val Asn Asn Ala Ser Arg 
850 855 860 

get gta ggt gta tgt act tea tta ctt tea aat gaa eta aaa cct tct 2640 
Ala Val Gly Val Cys Thr Ser Leu Leu Ser Asn Glu Leu Lys Pro Ser 
865 870 875 880 

ttt gtt gag aag eta gat att gat tac gaa cgt gtt aga gag cag cat 2688 
Phe Val Glu Lys Leu Asp He Asp Tyr Glu Arg Val Arg Glu Gin His 
885 890 895 

agt cgt aaa caa ccg cga act aag cct gtg act tta gag gtt get cga 2736 
Ser Arg Lys Gin Pro Arg Thr Lys Pro Val Thr Leu Glu Val Ala Arg 
900 90S 910 

gcg aat aaa gtc get att gac tgg get tct tat aca cct cct gtc cca 2784 
Ala Asn Lys Val Ala He Asp Trp Ala Ser Tyr Thr Pro Pro Val Pro 
915 920 925 

eta aag cct ggt gta cat ata ttt gat aac ttt gat gtt tea aca ttg 2832 
Leu Lys Pro Gly Val His He Phe Asp Asn Phe Asp Val Ser Thr Leu 
930 93S 940 

cgt aat tat att gat tgg ace cca ttt ttt atg acg tgg tct ctt gtt 2880 
Arg Asn Tyr He Asp Trp Thr Pro Phe Phe Met Thr Trp Ser Leu Val 
945 950 955 960 

gga aaa tac ccg aag ate tta gag cat gaa gaa gtt ggt gaa gaa gee 2928 
Gly Lys Tyr Pro Lys He Leu Glu His Glu Glu Val Gly Glu Glu Ala 
965 970 975 

aaa cga tta ttt aaa gat gca aat gat eta tta gat cga gtt gaa aaa 2976 
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Lys Arg Leu Phe Lys Asp Ala Asn Asp Leu Leu Asp Arg Val Glu Lys 
980 985 990 

gaa ggg tta ctt aaa gcc cgt gga atg tgt gcg eta ttt cca get tec 
Glu Gly Leu Leu Lys Ala Arg Gly Met Cys Ala Leu Phe Pro Ala Ser 
595 1000 1005 

agt gtt ggt gat gat att gaa gta tat act gat gaa tea cgc act aca 
Ser Val Gly Asp Asp He Glu Val Tyr Thr Asp Glu Ser Arg Thr Thr 
1010 1015 1020 



3024 



3072 



v=i f I ? f" " t 3at tt9 ° 9a caa caa ac9 9 a 9 aa 9 «9 aaa 3120 

Val Ala Lye Val Leu His Asn Leu Arg Gin Gin Thr Glu Lys Pro Lys 

1025 1030 loss iMO 

IT IV aat I 8t t9t " a tCt 9at tat ata gca ccc aaa gag teg ggt 3168 
Gly Phe Asn Tyr Cys Leu Ser Asp Tyr lie Ala Pro Lys Glu Ser Gly 
1° 45 1050 1055 

aaa aat gat tgg ate ggt ggt ttt get gta act ggt ggt att ggt gag 3216 
Lys Asn Asp Trp He Gly Gly Phe Ala Val Thr Gly Gly He Gly Glu 
1060 1065 1070 

cgt gaa eta get gat gaa tat aaa -gca -aat ggt gat gat tat aac get 3264 
Arg Glu Leu Ala Asp Glu Tyr Lys Ala Asn Gly Asp Asp Tyr Asn Ala 
1075 1080 1085 

t*? !?' °f 3 9 ? 9 9t9 9Ct 9at C9t cta 9Ct gaa get ttt get gaa 3312 
lie Met lie Gin Ala Val Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu 
1°90 1095 iioo 

tat tta cat gaa aaa gta cgt aag gaa att tgg ggt tac tct cct aat 3360 
Tyr Leu His Glu Lys Val Arg Lys Glu He Trp Gly Tyr Ser Pro Asn 
1105 111° 1115 1120 

? 39 Jf 9 f" tCa aat 98t 93t tta atc 9 aa aaa tac <*a 99C att 3408 
Glu Thr Leu Ser Asn Asp Asp Leu He Arg Glu Lys Tyr Gin Gly He 

ll 25 1130 H35 

cgt cct get cct ggt tac cca get tgt cct gaa cat aca gaa aaa ggg 3456 
Arg Pro Ala Pro Gly Tyr Pro Ala Cye Pro Glu His Thr Glu Lys Gly 
11*0 U45 iiso 7 

get tta tgg gag tta atg aat gtt gaa gaa tct att gga atg tct tta 3504 
Ala Leu Trp Glu Leu Met Asn Val Glu Glu Ser He Gly Met Ser Leu 
1155 H60 ii 6 5 



aca tea age tat gca atg tgg ccc ggt gca tct gtg tea gga atg tat 
Thr Ser Ser Tyr Ala Met Trp Pro Gly Ala Ser Val Ser Gly Met Tyr 
H" 70 1175 n8o 

ttt tea cac cca gat tct cgt tat ttt gcg att get cag att cag caa 
Phe Ser His Pro Asp Ser Arg Tyr Phe Ala He Ala Gin He Gin Gin 
1185 ll 90 1195 1200 

gat caa gcc gaa age tat gcc gat cgt aaa ggt tgg aat atg ctt gaa 
Asp Gin Ala Glu Ser Tyr Ala Asp Arg Lys Gly Trp Asn Met Leu Glu 
1205 1210 12 i5 

get gag aag tgg tta ggt cca aat ttg aat taa 
Ala Glu LyB Trp Leu Gly Pro Asn Leu Asn 
1220 1225 



3552 



3600 



3648 



3681 
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<210> 46 
<211> 1226 
<212> PRT 

<213> Vibrio fisheri 
<400> 46 

Val Ala Gly Ser Asn He Lye Val Gin He Glu Lys Gin Leu Ser Glu 
1 5 10 , 15 

Arg He Leu Leu He Asp Gly Gly Met Gly Thr Met He Gin Gly Tyr 
20 25 30 

Lys Phe Glu Glu Lys Asp Tyr Arg Gly Gly Arg Phe Asn Gin Trp His 
35 40 45 

Cys Asp Leu Lys Gly Asn Asn Asp Leu Leu Val Leu Ser Gin Pro Gin 
50 55 60 

He He Arg Asp He His Glu Ala Tyr Leu Glu Ala Gly Ala Asp He 
65 70 75 80 

Leu Glu Thr Asn Thr Phe Asn Ala Thr Thr He Ala Met Ala Asp Tyr 
B5 90 95 

Asp Met Glu Ser Leu Ser Glu Glu He Asn Phe Glu Ala Ala Lys Leu 
100 105 no 

Ala Arg Glu Val Ala Asp Lys Trp Thr Glu Lys Thr Pro Asn Lye Pro 
115 120 125 

Arg Tyr Val Ala Gly Val Leu Gly Pro Thr Asn Arg Thr Cys Ser He 
130 135 140 

Ser Pro Asp Val Asn Asp Pro Gly Phe Arg Asn Val Ser Phe Asp Glu 
145 150 155 160 

Leu Val Glu Ala Tyr Ser Glu Ser Thr Arg Ala Leu He Arg Gly Gly 
165 170 175 

Ser Asp Leu He Leu He Glu Thr He Phe Asp Thr Leu Asn Ala Lys 
180 185 190 

Ala Cys Ser Phe Ala Val Glu Ser Val Phe Glu Glu Leu Gly He Thr 
195 200 205 

Leu Pro Val Met He Ser Gly Thr He Thr Asp Ala Ser Gly Arg Thr 
210 215 220 

Leu Ser Gly Gin Thr Thr Glu Ala Phe Tyr Asn Ala Leu Arg His Val 
225 230 235 240 

Lys Pro He Ser Phe Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu 
245 250 255 

Arg Glu Tyr Val Ser Glu Leu Ser Arg He Ser Glu Cys Tyr Val Ser 
260 265 270 

Ala HiB Pro Asn Ala Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu 
275 280 285 
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Ser Pro Glu Asp Met Ala Glu His Val Ala Glu Trp Ala Ser Ser Gly 
290 295 300 

Phe Leu Aen Leu He Gly Gly Cys Cys Gly Thr Thr Pro Glu His He 
3 °5 310 315 320 

Arg Gin Met Ala Leu Val Val Glu Gly Val Lys Pro Arg Gin Leu Pro 
325 330 335 

Glu Leii Pro Val Ala Cys Arg Leu Ser Gly Leu Glu Pro Leu Thr He 
340 345 350 

Glu Lys Asp Ser Leu Phe He Asn Val Gly Glu Arg Thr Asn Val Thr 
355 360 365 

Gly Ser Ala Arg Phe Lys Arg Leu He Lys Glu Glu Leu Tyr Asp Glu 
370 375 380 

Ala Leu Ser Val Ala Gin Glu Gin Val Glu ABn Gly Ala Gin He He 
365 390 395 400 

Asp He Asn Met Asp Glu Gly Met Leu Asp Ala Glu Ala CyB Met Val 
405 410 415 

Arg Phe Leu Asn Leu Cys Ala Ser Glu Pro Glu He Ser Lys Val Pro 
420 425 430 

Val Met Val Asp Ser Ser Lys Trp Glu Val He Glu Ala Gly Leu Lys 
435 440 445 

Cys He Gin Gly Lys Gly He Val Asn Ser He Ser Leu Lys Glu Gly 
450 455 460 

Lys Glu Lys Phe Val His Gin Ala Lys Leu He Arg Arg Tyr Gly Ala 
465 470 475 ~ 480 

Ala Val He Val Met Ala Phe Asp Glu Val Gly Gin Ala Asp Thr Arg 
485 490 495 

Glu Arg Lys He Glu He Cys Thr ABn Ala Tyr Asn He Leu Val Asp 
500 505 510 

Glu Val Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Asn He Phe 
515 520 525 

Ala Val Ala Thr Gly He Asp Glu His Asn Asn Tyr Ala val Asp Phe 
530 535 540 

He Glu Ala Val Gly Asp He Lys Arg Thr Leu Pro His Ala Met He 
545 550 555 560 

Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asn Tyr 
565 570 575 

Val Arg Glu Ala He His Ala Val Phe Leu Tyr His Cys Phe Lys Asn 
580 585 590 

Gly Met Asp Met Gly He Val Asn Ala Gly Gin Leu Glu He Tyr Asp 
595 600 605 

Asn Val Pro Glu Asp Leu Arg Glu Ala Val Glu Asp Val Val Leu Asn 
610 615 620 
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Arg Arg Asp Asp Ser Thr Glu Arg Leu Leu Asp He Ala Thr Glu Tyr 
625 630 635 640 

Leu ' Glu Arg Ala Val Gly Lys Val Glu Asp Lys Ser Ala Leu Glu Trp 
645 650 655 

Arg Asp Trp Pro Val Glu Lys Arg Leu Glu His Ser Leu Val Lys Gly 
660 665 670 

He Thr Glu Phe He Val Glu Asp Thr Glu Glu Ala Arg lie Asn Ala 
675 680 685 

Glu Arg Pro He Glu Val He Glu Gly Pro Leu Met Asp Gly Met Asn 
690 695 700 

Val Val Gly Asp Leu Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val 
705 710 715 720 

Val Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala HiB Leu Glu Pro 
725 730 735 

Phe He Asn Ala Ser Lys Glu Val Gly Ala Thr Asn Gly Lys He Leu 

740 745 . . . _ 750 

Leu Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val 
755 760 765 

Gly Val Val Leu Gin Cys Asn Asn Tyr Glu He He Asp Leu Gly Val 
770 775 780 

Met Val Ser Cys Glu Thr He Leu Lys Val Ala Lys Glu Glu Asn Val 
785 790 795 800 

Asp He He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met 
80S 810 815 

Val His Val Ala Lys Glu Met Glu Arg Gin Gly Phe Asp Leu Pro Leu 
820 825 830 

Leu He Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys He 
835 640 845 

Glu Gin Asn Tyr Ser Gin Pro Val Val Tyr Val Asn Asn Ala Ser Arg 
850 855 860 

Ala Val Gly Val Cys Thr Ser Leu Leu Ser Asn Glu Leu Lye Pro Ser 
865 870 875 880 

Phe Val Glu Lys Leu Asp He Asp Tyr Glu Arg Val Arg Glu Gin His 
885 890 895 

Ser Arg Lys Gin Pro Arg Thr Lys Pro Val Thr Leu Glu Val Ala Arg 
900 905 910 

Ala Asn Lys Val Ala He Asp Trp Ala Ser Tyr Thr Pro Pro Val Pro 
915 920 925 

Leu Lys Pro Gly Val His lie Phe Asp Ash Phe Asp Val Ser Thr Leu 
930 935 940 

Arg Asn Tyr He Asp Trp Thr Pro Phe Phe Met Thr Trp Ser Leu Val 
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945 950 955 960 

Gly Lye Tyr Pro Lye He Leu Glu His Glu Glu Val Gly Glu Glu Ala 
965 970 975 

Lys Arg Leu Phe Lys Asp Ala Asn Asp Leu Leu Asp Arg Val Glu Lys 
980 985 990 

Glu Gly Leu Leu Lys Ala Arg Gly Met Cys Ala Leu Phe Pro Ala Ser 
995 1000 1005 

Ser Val Gly Asp Asp He Glu Val Tyr Thr Asp Glu Ser Arg Thr Thr 
1010 1015 1020 

Val Ala Lys Val Leu His Asn Leu Arg Gin Gin Thr Glu Lys Pro Lys 
1Q 25 1030 * 1035 1040 

Gly Phe Asn Tyr Cys Leu Ser Asp Tyr He Ala Pro Lys Glu Ser Gly 
1045 1050 " 1055 

Lys Asn Asp Trp He Gly Gly Phe Ala Val Thr Gly Gly He Gly Glu 
1060 1065 1070 

Arg Glu Leu Ala Asp Glu Tyr Lys Ala Asn Gly Asp Asp Tyr Asn Ala 
1075 1080 1085 

He Met He Gin Ala Val Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu 
1090 1095 iioo 

Tyr Leu His Glu Lys Val Arg Lys Glu He Trp Gly Tyr Ser Pro Asn 
1105 IHO 1115 1120 

Glu Thr Leu Ser Asn Asp Asp Leu lie Arg Glu Lys Tyr Gin Gly He 
1125 H30 H35 

Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Glu Lys Gly 
H40 H45 1150 

Ala Leu Trp Glu Leu Met Asn Val Glu Glu Ser He Gly Met Ser Leu 
H55 H60 H65 

Thr Ser Ser Tyr Ala Met Trp Pro Gly Ala Ser Val Ser Gly Met Tyr 
H70 1175 HBO 

Phe Ser His Pro Asp Ser Arg Tyr Phe Ala He Ala Gin He Gin Gin 
1185 H90 H95 1200 

Asp Gin Ala Glu Ser Tyr Ala Asp Arg Lys Gly Trp Asn Met Leu Glu 
1205 1210 1215 

Ala Glu Lys Trp Leu Gly Pro Asn Leu Asn 
1220 1225 



<210> 47 
<211> 3780 
<212> DNA 

<213> Agrobacterium tumefaciens 

<220> 

<221> CDS 

<222> (1) . . (3777) 
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<400> 47 

gtg ccc gtg ttt gac gac ctg ttt ggc cct gaa ggg gca aag-cgc gac 48 

Val Pro Val Phe Asp Asp Leu Phe Gly Pro Glu Gly Ala Lys Arg Asp 
1 5 10 15 

ggc gcg gaa att ttc aag gcg ttg cgc gat gcc gcc age gaa cgc ate 96 
Gly Ala Glu lie Phe Lys Ala Leu Arg Asp Ala Ala Ser Glu Arg lie 
20 25 30 

etc att etc gat ggt gcc atg ggc acg cag ate cag ggt etc ggt ttt 144 
Leu lie Leu Asp Gly Ala Met Gly Thr Gin He Gin Gly Leu Gly Phe 
35 40 45 

gac gag gat cat ttt cgt ggc gac cgt ttt ate ggc tgc gcc tgt cac 192 
Asp Glu Asp His Phe Arg Gly Asp Arg Phe He Gly Cys Ala Cys His 
50 55 60 

cag aag ggc aat aac gac ctt ctg ate ctg aca cag ccc gat gcc ate 240 
Gin Lys Gly Asn Asn Asp Leu Leu He Leu Thr Gin Pro Asp Ala He 
65 70 75 80 

gag gaa ate cac tat cgc tac gcc atg gcg ggc gcg gat att etc gaa 268 
Glu Glu He His Tyr Arg Tyr Ala Met Ala Gly Ala Asp He Leu Glu 
85 90 95 

ace aac acg ttt tec tec ace cgc ate gcg cag gcc gat tac, gag atg 336 
Thr Asn Thr Phe Ser Ser Thr Arg He Ala Gin Ala Asp Tyr Glu Met 
100 105 110 

gag aat gcc gtc tac gat etc aac cgc gag ggc gcg gcg ate gtg cgc 384 
Glu Asn Ala Val Tyr Asp Leu Asn Arg Glu Gly Ala Ala He Val Arg 
115 120 125 

egg gcg get cag cgc gcc gag cgc gag gat ggc cgc cgc cgt ttc gtg 432 
Arg Ala Ala Gin Arg Ala Glu Arg Glu Asp Gly Arg Arg Arg Phe Val 
130 135 140 

gcc ggt gcc ate ggt ccg acc aac cgc acg gcc teg ate teg cct gac 480 
Ala Gly Ala He Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro Asp 
145 150 155 160 

gtc aac aat ccc ggt tac cgc gcc gtc agt ttc gac gat ctg cgc att 528 
Val Asn Asn Pro Gly Tyr Arg Ala Val Ser Phe Asp Asp Leu Arg He 
165 170 175 

gcc tat ggc gag cag ate gat ggc ctg ate gac ggt ggt gcc gat ate 576 
Ala Tyr Gly Glu Gin He Asp Gly Leu lie Asp Gly Gly Ala Asp He 
180 185 190 

ate etc ate gag acg ate ttc gat acg ctg aac gcc aag gcg gcg ate 624 
He Leu He Glu Thr He Phe Asp Thr Leu Asn Ala Lys Ala Ala He 
195 200 205 

ttc gcc tgc gag gaa cgt ttc gag get aag ggc ate cgc ctg ccg gtc 672 
Phe Ala Cys Glu Glu Arg Phe Glu Ala Lys Gly He Arg Leu Pro Val 
210 215 220 

atg ate tea ggc acg ate acc gac ctt tec ggt cgc acg ttg tec ggc 720 
Met He Ser Gly Thr He Thr Asp Leu Ser Gly Arg Thr Leu Ser Gly 
225 230 235 240 
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cag acg cct teg gcg ttc tgg aac teg gtg cgc cac gee aac ccc ttc 768 
Gin Thr Pro Ser Ala Phe Trp Asn Ser Val Arg His Ala Asn Pro Phe 
245 250 255 

ace ate ggc etc aac tgc gcg etc ggt gcg gat gee atg cgc ccg cat 816 
Thr He Gly Leu Asn Cys Ala Leu Gly Ala Asp Ala Met Arg Pro His 
260 265 270 

ctg cag gaa ctg tec gat gtg gee gac acc ttt gtc tgc gee tat ccg 864 
Leu Gin Glu Leu Ser Asp Val Ala Asp Thr Phe Val Cys Ala Tyr Pro 
275 280 285 

aat gec ggc ctg ccg aac gag ttc ggc caa tat gac gaa acg ccc gag 912 
Asn Ala Gly Leu Pro Asn Glu Phe Gly Gin Tyr Asp Glu Thr Pro Glu 
290 295 300 

atg atg gcg cgc cag gtt gag ggc ttc gtt cgt gac ggt etc gtc aac 960 
Met Met Ala Arg Gin Val Glu Gly Phe Val Arg Asp Gly Leu Val Asn 
305 310 315 320 

ate gtc ggc ggt tgc tgc ggt teg acg ccg gaa cat ate egg gcg att 1008 
He Val Gly Gly Cys Cys Gly Ser Thr Pro Glu His He Arg Ala He 
325 330 335 

gee gaa gec gtc aag gat tac aag ccc cgc gaa att cct gaa cac aag 1056 
Ala Glu Ala Val Lys Asp Tyr Lys Pro Arg Glu He Pro Glu His Lys 
340 345 350 

ccg ttc atg teg ctt tec ggc ctt gaa ccc ttc gtg ctg acc aag gac 1104 
Pro Phe Met Ser Leu Ser Gly Leu Glu Pro Phe Val Leu Thr Lys Asp 
355 360 365 

att ccc ttc gtc aac gtg ggc gag cgc acc aac gtc acc ggt teg gec 1152 
He Pro Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala 
370 375 380 

cgc ttc cgc aag etc ate act gee ggc gac tat acg gcg gcg ctg get 1200 
Arg Phe Arg Lys Leu He Thr Ala Gly Asp Tyr Thr Ala Ala Leu Ala 
385 390 395 400 

gtt gec cgc gac cag gtg gaa aac ggc gcg cag ate ate gac ate aac 1248 
Val Ala Arg Asp Gin Val Glu Asn Gly Ala Gin He He Asp He Asn 
405 410 415 

atg gat gag ggc ctg ate gat teg gaa aag gcg atg gtc gag ttc ctg 1296 
Met Asp Glu Gly Leu lie Asp Ser Glu Lys Ala Met Val Glu Phe Leu 
420 425 430 

aac etc ate gec gec gag cct gac att gec cgt gtg ccc gtc atg ate 1344 
Asn Leu He Ala Ala Glu Pro Asp lie Ala Arg Val Pro Val Met He 
435 440 445 

gac tea tec aag ttc gag ate ate gag gec ggc ctg aaa tgc gtg cag 1392 
Asp Ser Ser Lys Phe Glu He He Glu Ala Gly Leu Lys Cys Val Gin 
450 455 460 

ggc aaa teg ate gtc aat tec att teg ctg aag gaa ggc gag gag aag 1440 
Gly Lys Ser He Val Asn Ser He Ser Leu Lys Glu Gly Glu Glu Lys 
465 470 475 480 

ttt etc cag cag get egg etc gtc cac aat tac ggt gcg gcg gtt gtc 14 88 
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Phe Leu Gin Gin Ala Arg Leu Val His Asn Tyr Gly Ala Ala Val Val 
485 490 49S 

gtc atg gcc ttt gat gag gtc ggg cag gcg gat acc tat cag, cgc aag 1536 
Val Met Ala Phe Asp Glu Val Gly Gin Ala Asp Thr Tyr Gin Arg Lys 
500 505 * 510 . 

gtg gaa ate tgc gcg cgc gcc tac aag ctt ctg acc gaa aag gcc ggt 1584 
Val Glu lie Cys Ala Arg Ala Tyr Lys Leu Leu Thr Glu Lys Ala Gly 
515 520 525 , 

ctg tct ccg gaa gac ate ate ttc gac ccg aat gtg ttt gcg gta get 1632 
Leu Ser Pro Glu Asp He He Phe Asp Pro Asn Val Phe Ala Val Ala 
530 535 540 

acg ggc ate gag gag cac aat aat tac ggc gtg gac ttc ate gag gcc 1680 
Thr Gly He Glu Glu His Asn Asn Tyr Gly Val Asp Phe He Glu Ala 
545 550 555 560 

acc aag acc ate cgc gaa acc atg ccg etc acg cat att tec ggg ggc 1728 
Thr Lys Thr He Arg Glu Thr Met Pro Leu Thr His He Ser Gly Gly 
565 570 575 

gtt tec aac ctg tec ttc tec ttc cgc ggc aat gag ccg gtg cgt gag 1776 
Val Ser Asn Leu Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg Glu 
580 565 590 

gcg atg cat gcc gtg ttc etc tat cac gcc att cag gtc ggc atg gat 1824 
Ala Met His Ala Val Phe Leu Tyr His Ala He Gin Val Gly Met Asp 
595 600 605 

atg ggc ate gtc aac gcc ggg cag ctt gcg gtt tac gac aat ate gat 1872 
Met Gly He Val Asn Ala Gly Gin Leu Ala Val Tyr Asp Asn He Asp 
610 615 620 

gcg gaa ctg cgc gag gcc tgc gaa gac gtg gtg ctg aac cgc cgc gac 1920 
Ala Glu Leu Arg Glu Ala Cys Glu Asp Val Val Leu Asn Arg Arg Asp 
625 630 635 640 

gat gcc acg gag cgt ctg etc gag gtg gcg gag cgt ttc cgt ggt acg 1968 
Asp Ala Thr Glu Arg Leu Leu Glu Val Ala Glu Arg Phe Arg Gly Thr 
645 650 ~ 655 

ggt gaa aaa cag gcc aag gtg cag gat ctt tec tgg cgc gag tat ccc 2 016 
Gly Glu Lys Gin Ala Lys Val Gin Asp Leu Ser Trp Arg Glu Tyr Pro 
660 665 670 

gtt gaa aag egg ctg gaa cat get ctg gtc aac ggc att acc gac tat 2064 
Val Glu Lys Arg Leu Glu HiB Ala Leu Val Asn Gly He Thr Asp Tyr 
675 680 685 

ate gag gcc gat acg gaa gag gca cgc cag cag gcc gcc cgc ccg ctg 2112 
He Glu Ala Asp Thr Glu Glu Ala Arg Gin Gin Ala Ala Arg Pro Leu 
690 695 700 

cat gtc ate gaa ggg ccg ctg atg gcc ggt atg aat gtg gtg ggt gac 2160 
His Val He Glu Gly Pro Leu Met Ala Gly Met Asn Val Val Gly Asp 
705 710 715 720 

ctg ttc ggt tec ggc aag atg ttc ctg cca cag gtg gtg aaa tec gcc 2208 
Leu Phe Gly Ser Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala 
725 730 735 
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cgt gtg atg aag cag gcg gtt gcc gtt ctg ctg cct tac atg gaa gag 2256 
Arg Val Met Lys Gin Ala Val Ala Val Leu Leu Pro Tyr Met Glu Glu 
740 745 750 

gaa aag cgc ctg aat ggc ggt tec gag cgc agt gcc gcc ggc aag gtg 2304 
Glu Lys Arg Leu Asn Gly Gly Ser Glu Arg Ser Ala Ala Gly Lys Val 
755 760 765 

eta atg gcg acc gtg aag ggc gac gtg cac gat ate ggc aag aac ate 2352 
Leu Met Ala Thr Val Lys Gly Asp Val His Asp lie Gly Lys Asn lie 
770 775 780 

gtc ggc gtt gtg eta gcc tgc aac aat tac gag ate att gat etc ggc 2400 
Val Gly Val Val Leu Ala Cys Asn Asn Tyr Glu He He Asp Leu Gly 
785 790 795 800 

gtg atg gtg ccg acg acg aaa ate etc gaa acg gcg ate gcc gaa aag 2448 
Val Met Val Pro Thr Thr Lys He Leu Glu Thr Ala He Ala Glu Lys 
805 810 815 

9tg gat gtg ate ggc etc tec ggc etc ate acc ccg teg ctg gat gag 2496 
Val Asp Val He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu 
820 825 830 

atg gtg cat gtg gcg gcc gaa atg gag cga cag ggt ttc gac att ccg 2544 
Met Val His Val Ala Ala Glu Met Glu Arg Gin Gly Phe Asp He Pro 
835 840 845 

ctg ctg ate ggc ggt gcg acg acc age cgt gtg cat acg gcg gta aaa 2592 
Leu Leu He Gly Gly Ala Thr Thr Ser Arg Val His Thr Ala Val Lys 
850 855 860 

ate cat ccg cgt tac gag cag ggg cag gcg ate tat gtc acc gac gcc 2640 
He His Pro Arg Tyr Glu Gin Gly Gin Ala He Tyr Val Thr Asp Ala 
865 870 875 880 

teg cgc gcg gtg ggc gtc gtt tea gcg etc etc tec gaa gag cag aag 2688 
Ser Arg Ala Val Gly Val Val Ser Ala Leu Leu Ser Glu Glu Gin Lys 
885 890 895 

ccc get tat ate gac ggc ate cga gcc gaa tat gcc aag gtg gcg gaa 2736 
Pro Ala Tyr He Asp Gly He Arg Ala Glu Tyr Ala Lys Val Ala Glu 
900 905 910 



gcc cat gcc cgc aat gag cgc gaa aag cag cgc ctg ccg ctt tec cgc 
Ala His Ala Arg Asn Glu Arg Glu Lys Gin Arg Leu Pro Leu Ser Arg 
915 920 925 



2784 



gcc egg gag aat gcg cac aag ate gac tgg teg age tac age gtt gtc 2832 
Ala Arg Glu Asn Ala His Lys He ABp Trp Ser Ser Tyr Ser Val Val 
930 935 940 



aag ccg cag ttc ttc ggc acc aag gtt ttt gag acc tat gat ctg gaa 
Lys Pro Gin Phe Phe Gly Thr Lys Val Phe Glu Thr Tyr Asp Leu Glu 
945 950 955 



2880 



960 



gag ctt tec cgt tac ate gac tgg acg ccg ttc ttc cag acc tgg gaa 2928 
Glu Leu Ser Arg Tyr lie Asp Trp Thr Pro Phe Phe Gin Thr Trp Glu 
965 970 975 

ttg aag ggc cgt ttc ccg gcg ate ctt gaa gac gaa aag cag ggc gag 2976 
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Leu Lys Gly Arg Phe Pro Ala He Leu Glu Asp Glu Lys Gin Gly Glu 
9B0 985 990 



gcg.gcg egg cag ctt tat gec gat gcg cag gec atg ctt gcg aag ate 
Ala Ala Arg Gin Leu Tyr Ala Aflp Ala Gin Ala Met Leu Ala Lys He 
995 1000 1005 



3024 



ate gag gaa aag tgg ttc cga cca cgc gcg gtg ate ggc ttc tgg ccg 3072 
He Glu Glu Lys Trp Phe Arg Pro Arg Ala Val He Gly Phe Trp Pro 
1010 1015 1020 



gee aat gec gtg ggt gac gat ate agg etc ttt acg gat gaa ggt egg 
Ala Asn Ala Val Gly Asp Asp He Arg Leu Phe Thr Asp Glu Gly Arg 
1025 1030 1035 " 1040 



3120 



aag gaa gag ttg gcg acg ttc ttc acg ctg cgc cag cag ctt tec aag 
Lys Glu Glu Leu Ala Thr Phe Phe Thr Leu Arg Gin Gin Leu Ser Lys 
1045 1050 1055 



3168 



cgc gat ggc cgt ccg aac gtg gcg ctg tec gat ttc gtc gcg ccc gtc 
Arg Asp Gly Arg Pro Asn Val Ala Leu Ser Asp Phe Val Ala Pro Val 
1060 1065 1070 



3216 



gat age ggc gtt gee -gat tat gtc ggc ggt ttc gtg gta acg gcg ggt 
Asp Ser Gly Val Ala Asp Tyr Val Gly Gly Phe Val Val Thr Ala Gly 
1075 1080 1085 



3264 



ate gag gaa gtg gcg att gec gag cgc ttc gag egg gec aat gac gat 
He Glu Glu Val Ala He Ala Glu Arg Phe Glu Arg Ala Asn Asp Asp 
1090 1095 1100 



3312 



tat teg tec ate etc gtc aag gcg ttg get gac cgt ttt gee gaa gec 
Tyr Ser Ser He Leu Val Lys Ala Leu Ala Asp Arg Phe Ala Glu Ala 
H05 1110 His 1120 



3360 



ttt gec gag cgt atg cat gag cgc gtg cgc aag gag ttc tgg ggt tat 
Phe Ala Glu Arg Met His Glu Arg Val Arg Lys Glu Phe Trp Gly Tyr 
1125 1130 1135 



3408 



gcg ccg gac gag get ctt gee ggt gac gat ctg ata ggc gaa gec tat 
Ala Pro Asp Glu Ala Leu Ala Gly Asp Asp Leu He Gly Glu Ala Tyr 
1140 1145 1150 



3456 



gee ggt ate cgc ccg gca ccg ggt tat ccg gee cag ccg gac cac ace 
Ala Gly lie Arg Pro Ala Pro Gly Tyr Pro Ala Gin Pro Asp His Thr 
1155 H60 H65 



3504 



gaa aag aag acg ctg ttt get ctg ctg gac gee ace aat gcg gcg ggt 
Glu Lys Lys Thr Leu Phe Ala Leu Leu Asp Ala Thr Asn Ala Ala Gly 
1170 1175 H80 



3552 



gtg gaa ttg acg gaa age tat gcg atg tgg ccc ggc teg teg gtt teg 
Val Glu Leu Thr Glu Ser Tyr Ala Met Trp Pro Gly Ser Ser Val Ser 
1185 1190 1195 1200 



3600 



ggc etc tat ate ggc cat ccc gaa age tat tat ttc ggc gtt gec aag 
Gly Leu Tyr He Gly His Pro Glu Ser Tyr Tyr Phe Gly Val Ala Lys 
1205 1210 1215 



3648 



gtg gag egg gat cag gtt etc gac tat gcg cgc cgc aag gat atg ccg 
Val Glu Arg ABp Gin Val Leu Asp Tyr Ala Arg Arg Lys Asp Met Pro 
1220 1225 1230 



3696 
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gtc aca gag gtg gag cgc tgg etc ggg ccg gtg etc aac tac gtg ccg 3744 
Val Thr Glu Val Glu Arg Trp Leu Gly Pro Val Leu Asn Tyr Val Pro 
1235 1240 1245 

acc aac ggc gag gag aaa ate gac age get gcg tga 3780 
Thr Asn Gly Glu Glu Lys lie Asp Ser Ala Ala 
1250 1255 



<210> 48 
<211> 1259 
<212> PRT 

<213> Agrobacterium tumefaciens 
<400> 46 

Val Pro Val Phe Asp Asp Leu Phe Gly Pro Glu Gly Ala Lys Arg Asp 
1 5 10 15 

Gly Ala Glu lie Phe Lys Ala Leu Arg Asp Ala Ala Ser Glu Arg lie 
20 25 30 

Leu lie Leu Asp Gly Ala Met Gly Thr Gin lie Gin Gly Leu Gly Phe 
35 40 45 

Asp Glu Asp His Phe Arg Gly Asp Arg Phe lie Gly Cys Ala Cys His 
50 55 60 

Gin Lys Gly Asn Asn Asp Leu Leu lie Leu Thr Gin Pro Asp Ala He 
65 70 75 80 

Glu Glu He His Tyr Arg Tyr Ala Met Ala Gly Ala Asp He Leu Glu 
85 90 95 

Thr Asn Thr Phe Ser Ser Thr Arg He Ala Gin Ala Asp Tyr Glu Met 
100 105 110 

Glu Asn Ala Val Tyr Asp Leu Asn Arg Glu Gly Ala Ala He Val Arg 
115 120 125 

Arg Ala Ala Gin Arg Ala Glu Arg Glu Asp Gly Arg Arg Arg Phe Val 
130 135 140 

Ala Gly Ala He Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro Asp 
145 150 155 160 

Val Asn Asn Pro Gly Tyr Arg Ala Val Ser Phe Asp Asp Leu Arg He 
165 170 175 

Ala Tyr Gly Glu Gin He Asp Gly Leu He ABp Gly Gly Ala Asp He 
180 185 190 

He Leu He Glu Thr He Phe Asp Thr Leu Asn Ala Lys Ala Ala He 
195 200 205 

Phe Ala Cys Glu Glu Arg Phe Glu Ala Lys Gly He Arg Leu Pro Val 
210 215 220 

Met lie Ser Gly Thr He Thr Asp Leu Ser Gly Arg Thr Leu Ser Gly 
225 230 235 240 

Gin Thr Pro Ser Ala Phe Trp Asn Ser Val Arg His Ala Asn Pro Phe 
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214 



245 



250 



255 



Thr He Gly Leu Asn Cys Ala Leu Gly Ala Asp Ala Met Arg Pro His 
260 265 270 - 

Leu Gin Glu Leu Ser Asp Val Ala Asp Thr Phe Val Cys Ala Tyr Pro 
275 280 265 

Asn Ala Gly Leu Pro Asn Glu Phe Gly Gin Tyr Asp Glu Thr Pro Glu 
290 295 300 

Met Met Ala Arg Gin Val Glu Gly Phe Val Arg Asp Gly Leu Val Aen 
305 310 315 320 

He Val Gly Gly Cys Cys Gly Ser Thr Pro Glu His He Arg Ala lie 
325 330 335 

Ala Glu Ala Val Lys Asp Tyr Lys Pro Arg Glu He Pro Glu His Lys 
340 345 350 

Pro Phe Met Ser Leu Ser Gly Leu Glu Pro Phe Val Leu Thr Lys Asp 
355 360 365 

lie -Pro Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala 
370 375 380 

Arg Phe Arg Lys Leu He Thr Ala Gly Asp Tyr Thr Ala Ala Leu Ala 
385 390 395 400 

Val Ala Arg Asp Gin Val Glu Asn Gly Ala Gin He He Asp He Asn 
405 410 415 

Met Asp Glu Gly Leu He Asp Ser Glu Lys Ala Met Val Glu Phe Leu 
420 425 430 

Asn Leu He Ala Ala Glu Pro Asp He Ala Arg Val Pro Val Met He 
435 440 445 

Asp Ser Ser Lys Phe Glu He lie Glu Ala Gly Leu Lys Cys Val Gin 
450 455 460 

Gly Lys Ser lie Val Asn Ser He Ser Leu Lys Glu Gly Glu Glu Lys 
465 470 475 480 

Phe Leu Gin Gin Ala Arg Leu Val His Asn Tyr Gly Ala Ala Val Val 
485 490 495 

Val Met Ala Phe Asp Glu Val Gly Gin Ala Asp Thr Tyr Gin Arg Lys 
500 505 510 

Val Glu He Cys Ala Arg Ala Tyr Lys Leu Leu Thr Glu Lys Ala Gly 
515 520 525 

Leu Ser Pro Glu Asp He lie Phe Asp Pro Asn Val Phe Ala Val Ala 
530 535 540 

Thr Gly lie Glu Glu His Asn Asn Tyr Gly Val Asp Phe lie Glu Ala 
545 550 555 560 

Thr Lys Thr lie Arg Glu Thr Met Pro Leu Thr His He Ser Gly Gly 



565 



570 



575 
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Val Ser Asn Leu Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg Glu 
580 585 590 

Ala Met His Ala Val Phe Leu Tyr His Ala He Gin Val Gly Met Aop 
595 600 605 

Met Gly He Val Asn Ala Gly Gin Leu Ala Val Tyr Asp Asn He Asp 
610 615 620 

Ala Glu Leu Arg Glu Ala Cys Glu Asp Val Val Leu Asn Arg Arg Asp 
625 630 635 640 

Asp Ala Thr Glu Arg Leu Leu Glu Val Ala Glu Arg Phe Arg Gly Thr 
645 650 655 

Gly Glu Lys Gin Ala Lys Val Gin Asp Leu Ser Trp Arg Glu Tyr Pro 
660 665 670 

Val Glu Lys Arg Leu Glu His Ala Leu Val Asn Gly He Thr Asp Tyr 
675 680 685 

He Glu Ala Asp Thr Glu Glu Ala Arg Gin Gin Ala Ala Arg Pro Leu 
690 695 700 

His Val He Glu Gly Pro Leu Met Ala Gly Met Asn Val Val Gly Asp 
705 710 715 720 

Leu Phe Gly Ser Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala 
725 730 735 

Arg Val Met Lys Gin Ala Val Ala Val Leu Leu Pro Tyr Met Glu Glu 
740 745 750 

Glu Lys Arg Leu Asn Gly Gly Ser Glu Arg Ser Ala Ala Gly Lys Val 
755 760 765 

Leu Met Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He 
770 775 780 

Val Gly Val Val Leu Ala Cys Asn Asn Tyr Glu He He Asp Leu Gly 
785 790 795 800 

Val Met Val Pro Thr Thr Lys He Leu Glu Thr Ala He Ala Glu Lys 
805 810 815 

Val Asp Val He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu 
820 825 830 

Met Val His Val Ala Ala Glu Met Glu Arg Gin Gly Phe Asp He Pro 
835 840 845 

Leu Leu He Gly Gly Ala Thr Thr Ser Arg Val His Thr Ala Val Lys 
850 855 860 

lie His Pro Arg Tyr Glu Gin Gly Gin Ala He Tyr Val Thr Asp Ala 
865 870 875 880 

Ser Arg Ala Val Gly Val Val Ser Ala Leu Leu Ser Glu Glu Gin Lys 
865 890 895 

Pro Ala Tyr lie Asp Gly lie Arg Ala Glu Tyr Ala Lys Val Ala Glu 
900 905 910 
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Ala His Ala Arg Asn Glu Arg Glu Lys Gin Arg Leu Pro Leu Ser Arg 
915 920 925 

Ala Arg Glu Asn Ala His Lye He Asp Trp Ser Ser Tyr Ser Val Val 
930 935 940 

Lys Pro Gin Phe Phe Gly Thr Lys Val Phe Glu Thr Tyr Asp Leu Glu 
945 950 955 * 960 

Glu Leu Ser Arg Tyr He Asp Trp Thr Pro Phe Phe Gin Thr Trp Glu 
965 970 975 

Leu Lye Gly Arg Phe Pro Ala He Leu Glu Asp Glu Lys Gin Gly Glu 
980 985 " 990 

Ala Ala Arg Gin Leu Tyr Ala Asp Ala Gin Ala Met Leu Ala Lys He 
995 1000 1005 

He Glu Glu Lys Trp Phe Arg Pro Arg Ala Val He Gly Phe Trp Pro 
1010 1015 1020 

Ala Asn Ala Val Gly Asp Asp He Arg Leu Phe Thr Asp Glu Gly Arg 
1025 _ „ 1030 . , 1035 1040 

Lys Glu Glu Leu Ala Thr Phe Phe Thr Leu Arg Gin Gin Leu Ser Lys 
1045 1050 1055 

Arg Asp Gly Arg Pro Asn Val Ala Leu Ser Asp Phe Val Ala Pro Val 
1060 1065 1070 

Asp Ser Gly Val Ala Asp Tyr Val Gly Gly Phe Val Val Thr Ala Gly 
1075 1080 1085 

He Glu Glu Val Ala He Ala Glu Arg Phe Glu Arg Ala Asn Asp Asp 
1090 1095 1100 

Tyr Ser Ser He Leu Val Lys Ala Leu Ala Asp Arg Phe Ala Glu Ala 
1105 1110 1115 1120 

Phe Ala Glu Arg Met His Glu Arg Val Arg Lys Glu Phe Trp Gly Tyr 
1125 1130 1135 

Ala Pro Asp Glu Ala Leu Ala Gly Asp Asp Leu He Gly Glu Ala Tyr 
1140 1145 1150 

Ala Gly He Arg Pro Ala Pro Gly Tyr Pro Ala Gin Pro Asp His Thr 
1155 1160 1165 

Glu Lys Lys Thr Leu Phe Ala Leu Leu Asp Ala Thr Asn Ala Ala Gly 
1170 1175 1180 

Val Glu Leu Thr Glu Ser Tyr Ala Met Trp Pro Gly Ser Ser Val Ser 
1185 1190 1195 1200 

Gly Leu Tyr He Gly His Pro Glu Ser Tyr Tyr Phe Gly Val Ala Lys 
1205 1210 1215 

Val Glu Arg Asp Gin Val Leu Asp Tyr Ala Arg Arg Lys Asp Met Pro 
1220 1225 1230 

Val Thr Glu Val Glu Arg Trp Leu Gly Pro Val Leu Asn Tyr Val Pro 
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1235 1240 1245 

Thr Asn Gly Glu Glu Lys He Asp Ser Ala Ala 
1250 1255 



<210> 49 
<211> 2718 
<212> DNA 

<213> Ralstonia solanacearum 

<220> 

<221> CDS 

<222> (1)..(2715) 

<223> RSOL_GMI1000 

<400> 49 

atg acc gac cac etc atg cgc etc tec ggc etc gaa ccg ttc aac ate 48 
Met Thr Asp Hie Leu Met Arg Leu Ser Gly Leu Glu Pro Phe Aan He 
1 5 10 15 

ggc gag gac acg ctg ttc gtc aac gtc ggc gaa cgc acc aac gtc acc 96 
Gly Glu Asp Thr Leu Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr 
20 25 30 

gga tec aag gcg ttc gcg cgc atg ate etc aac age cag ttc gac gag 144 
Gly Ser Lys Ala Phe Ala Arg Met He Leu Asn Ser Gin Phe Asp Glu 
35 40 45 

gcg etc gee gtg gca cgc cag cag gtc gag aac ggc gcg cag gtc ate 192 
Ala Leu Ala Val Ala Arg Gin Gin Val Glu Asn Gly Ala Gin Val He 
50 55 60 

gac ate aac atg gac gag gec atg etc gac tec aag gcg gcg atg gtg 240 
Asp He Asn Met Asp Glu Ala Met Leu Asp Ser Lys Ala Ala Met Val 
65 70 75 80 

cgc ttc ctg aac ctg ate gee teg gag ccg gac ate gcg cgc gtg ccg 288 
Arg Phe Leu Asn Leu He Ala Ser Glu Pro Asp He Ala Arg Val . Pro 
85 90 95 

ate atg ate gac teg tec aag tgg gag gtg ate gag gee ggc ctg aag 336 
He Met He Asp Ser Ser Lys Trp Glu Val He Glu Ala Gly Leu Lys 
100 105 110 

tgc gtg cag ggc aag gee ate gtc aac teg ate teg etc aag gaa ggc 384 
Cys Val Gin Gly Lys Ala He Val ABn Ser He Ser Leu Lys Glu Gly 
115 120 125 

gag gaa cag ttc gee cac cac gec aag ctg ate aag cgc tac ggc gee 432 
Glu Glu Gin Phe Ala His His Ala Lys Leu He Lys Arg Tyr Gly Ala 
130 135 140 

gee gee gtg gtg atg gee ttc gac gag cag ggc cag gec gac acg ttc 480 
Ala Ala Val Val Met Ala Phe Asp Glu Gin Gly Gin Ala Asp Thr Phe 
145 150 155 160 

gcg cgc aag acc gag ate tgc aag cgc age tat gac ttc etc gtg aac 528 
Ala Arg Lys Thr Glu He Cys Lys Arg Ser Tyr Asp Phe Leu Val Asn 
165 170 175 



cag gtc ggc ttt gcg ccg gaa 



gac ate ate ttc gat ccg aac ate ttc 



576 
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Gin Val Gly Phe Ala Pro Glu Asp lie lie Phe Asp Pro Asn He Phe 
180 185 * 190 

gcg gtc gcc acc ggc ate gag gag cac aac aac tac gec gtc „ gac ttc 624 
Ala Val Ala Thr Gly He Glu Glu H1b Asn ABn Tyr Ala Val Asp Phe 
195 200 205 

ate gag gcc acg cgc tgg ate aag cag aaa ttg ccg cac gcc aag gtg 672 
He Glu Ala Thr Arg Trp He Lys Gin Lys Leu Pro His Ala Lys Val 
210 215 220 

age ggc ggc gtg teg aac gtc teg ttc teg ttc cgc ggc aac gac gtg 720 
Ser Gly Gly Val Ser ABn Val Ser Phe Ser Phe Arg Gly Asn Asp Val 
225 230 235 240 

gtg cgc gag gcc ate cac acc gtg ttc ctg tac cac gcc ate ggt gcg 768 
Val Arg Glu Ala He His Thr Val Phe Leu Tyr His Ala He Gly Ala 
245 250 255 

ggc atg gac atg ggc ate gtc aac gcg ggc cag ttg ggc gtg tac gag 816 
Gly Met Asp Met Gly He Val Asn Ala Gly Gin Leu Gly Val Tyr Glu 
260 265 270 

aac etc gcc ccc gaa ctg cgc gag cgc gtg gaa gac gtg gtg etc aac 864 
Asn Leu Ala Pro Glu Leu Arg Glu Arg Val Glu Asp Val Val Leu Asn 
275 280 285 

cgc cgc ccg gat gcg acc gac cgc ctg ctg gaa att gcc gac cgc tac 912 
Arg Arg Pro Asp Ala Thr Asp Arg Leu Leu Glu He Ala Asp Arg Tyr 
290 295 300 

aag ggc ggc ggc gcc aag cgc gag gag aac etc gcc tgg cgc cag gag 960 
Lys Gly Gly Gly Ala Lys Arg Glu Glu Asn Leu Ala Trp Arg Gin Glu 
305 310 315 ' 320 

ccg gtg gaa aag cgc ctg gcc cac gcg etc gtg cac ggc ate acc gac 1008 
Pro Val Glu Lys Arg Leu Ala His Ala Leu Val His Gly He Thr Asp 
325 330 335 

tac gtg gtc gaa gac acc gag gaa gtt cgc cag aag ate ttt gcc gcc 1056 
Tyr Val Val Glu Asp Thr Glu Glu Val Arg Gin Lys He Phe Ala Ala 
340 345 350 

ggc ggc cgc ccg ate cag gtg ate gag ggc ccg ctg atg gac ggc atg 1104 
Gly Gly Arg Pro He Gin Val He Glu Gly Pro Leu Met Asp Gly Met 
355 360 365 

aac ate gtc ggc gat ctg ttc ggc gcg ggc aag atg ttc ctg ccg cag 1152 
Asn He Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu Pro Gin 
370 375 380 

gtg gtg aaa tec gcc cgc gtg atg aag cag gcg gtg gcc cac ctg ate 1200 
Val Val Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala His Leu He 
385 390 395 400 

ccg ttc ate gag gaa gag aag egg cag ate gcg gcc gcc ggc ggc gac 1248 
Pro Phe He Glu Glu Glu Lys Arg Gin He Ala Ala Ala Gly Gly Asp 
405 410 415 

gtg cgc teg cgc ggc aag ate gtc ate gcc acc gtg aag ggc gac gtg 1296 
Val Arg Ser Arg Gly Lys He Val He Ala Thr Val Lys Gly Asp Val 
420 425 430 
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cac gac ate ggc aag aac ate gtc acc gtc gtg etc cag tgc aac aac 1344 
His Asp He Gly Lys Asn He Val Thr Val Val Leu Gin Cys Asn Aan 
435 440 445 

ttc gaa gtc gtg aac atg ggc gtg atg gtc ccg tgc aac gag ate ctg 1392 
Phe Glu Val Val Asn Met Gly Val Met Val Pro Cys Asn Glu He Leu 
450 455 460 

gec aag gcg aag gtc gag ggc gcg gac ate ate ggc ctg teg ggc ctg 1440 
Ala Lys Ala Lys Val Glu Gly Ala Asp He He Gly Leu Ser Gly Leu 
465 470 475 480 

ate aca ccg teg ctg gaa gag atg gee tac gtg gee tec gag atg cag 1488 
He Thr Pro Ser Leu Glu Glu Met Ala Tyr Val Ala Ser Glu Met Gin 
485 490 495 

cgc gac gag tac ttc cgc gtg aag aag ate ccg ctg ctg ate ggt ggc 1536 
Arg Asp Glu Tyr Phe Arg Val Lys Lys He Pro Leu Leu He Gly Gly 
500 505 510 

gcg acc acg age cgc gtg cac acc gee gtg aag ate gcg ccc aat tac 1584 
Ala Thr Thr Ser Arg Val His Thr Ala Val Lys He Ala Pro ABn Tyr 
515 520 525 

gaa ggc ccg gtc gtg tac gtg ccc gac gee teg cgc teg gtg age gtg 1632 
Glu Gly Pro Val Val Tyr Val Pro Asp Ala Ser Arg Ser Val Ser Val 
530 535 540 

gee tec age ctg ctg tec gac gag gee gee gcg cgc tac ate gaa gag 1680 
Ala Ser Ser Leu Leu Ser Asp Glu Ala Ala Ala Arg Tyr He Glu Glu 
545 550 555 ~ 560 

ctg cac gee gac tac gac cgc ate cgc acc cag cac gec age aag aaa 1728 
Leu His Ala Asp Tyr Asp Arg He Arg Thr Gin His Ala Ser Lys Lys 
565 570 575 

gec atg ccg atg gtg teg ctg gee gee gcg cgc gee aac aag acc egg 1776 
Ala Met Pro Met Val Ser Leu Ala Ala Ala Arg Ala Asn LyB Thr Arg 
580 585 590 

ate gac tgg teg aac tac acg ccg ccc aag ccc aag ttc gtc ggc cgc 1824 
He Asp Trp Ser Asn Tyr Thr Pro Pro Lys Pro Lys Phe Val Gly Arg 
595 600 " 605 

cgc gtg ttc cgc aac tac gac ctg aac gag etc gcg cag tac ate gac 1872 
Arg Val Phe Arg Asn Tyr Asp Leu Asn Glu Leu Ala Gin Tyr He Asp 
610 615 620 

tgg ggc ccg ttc ttc cag acg tgg gac ctg gee ggc aaa ttc ccc gac 1920 
Trp Gly Pro Phe Phe Gin Thr Trp Asp Leu Ala Gly Lys Phe Pro Asp 
625 630 635 640 

ate etc aac gac gcg ate gtc ggc gaa teg gec cgc cgc gtg ttc tec 1968 
He Leu Asn Asp Ala He Val Gly Glu Ser Ala Arg Arg Val Phe Ser 
645 650 655 

gac ggc aag age atg etc gcg cgc ctg ate gee gga cgc tgg ctg acg 2016 
Asp Gly Lys Ser Met Leu Ala Arg Leu He Ala Gly Arg Trp Leu Thr 
660 665 670 

gec aac ggc gtg ate gcg ctg ctg ccg gee aac acc gtc aac gac gac 2 064 
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Ala Asn Gly Val He Ala Leu Leu Pro Ala Asn Thr Val Asn Asp Asp 
675 680 685 

gac ,atc gag ate tac acc gac gag acc cgc teg gaa gtc gee etc ace 2112 
Asp He Glu He Tyr Thr Asp Glu Thr Arg Ser Glu Val Ala Leu Thr 
690 695 700 

tgg cgc aac ate cgc cag cag age gag cgc ccg ate ate gac ggc gtg 2160 
Trp Arg Asn He Arg Gin Gin Ser Glu Arg Pro He He Asp Gly Val 
705 710 715 720 

atg cgc ccg aac cgc tgc ctg gcg gac ttc ate gee ccc aag gac acc 2208 
Met Arg Pro Asn Arg Cys Leu Ala Asp Phe He Ala Pro Lys Asp Thr 
725 730 735 

ggc ate gee gat tac ate ggc etc ttc gcg gtg acg ggc ggc ate ggg 2256 
Gly He Ala Asp Tyr He Gly Leu Phe Ala Val Thr Gly Gly He Gly 
740 745 750 

ate gac aag cgc gaa gee gec ttc gaa gee gac cac gac gac tac age 2304 
He Asp Lys Arg Glu Ala Ala Phe Glu Ala Asp His Asp Asp Tyr Ser 
755 760 765 

gcg ate atg etc aag gee ctg gee gac cgc ttc gee gaa gee ttc gee 2352 
Ala He Met Leu Lys Ala Leu Ala Asp Arg Phe Ala Glu Ala Phe Ala 
770 775 780 

gag tgc ctg cac gee cgt gtg cgc cgc gac ctg tgg ggc tac gcg cag 2400 
Glu Cys Leu His Ala Arg Val Arg Arg Asp Leu Trp Gly Tyr Ala Gin 
785 790 795 800 

gac gaa acg etc gac aac gac gcg ctg ate cgc gag gaa tac cgc ggc 2448 
Asp Glu Thr Leu Asp Asn Asp Ala Leu He Arg Glu Glu Tyr Arg Gly 
805 810 * 815 

ate cgc ccg gcg ccc ggc tac ccg gee tgc ccg gag cac acc gtc aag 24 96 
He Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Val Lys 
820 825 830 

cgc gac ctg ttc cgc gtg etc gac gcg cag gag ate ggc atg aac ctg 2544 
Arg Asp Leu Phe Arg Val Leu Asp Ala Gin Glu He Gly Met Asn Leu 
835 840 845 

acc gag gcg ctg gcg atg aca ccg gee gcg teg gtc teg ggc ttc cag 2592 
Thr Glu Ala Leu Ala Met Thr Pro Ala Ala Ser Val Ser Gly Phe Gin 
850 855 860 



ctg teg cac ccg gac age acg tac ttc 
Leu Ser His Pro Asp Ser Thr Tyr Phe 
865 870 

gac cag gtg gac gac atg gee gcg cgc 
Asp Gin Val Asp Asp Met Ala Ala Arg 
885 

gtg gag cgc gee ctg gca ccc aac ctg 
Val Glu Arg Ala Leu Ala Pro Asn Leu 
900 905 



acg ate ggc aag ate ggc cag 2640 

Thr He Gly Lys He Gly Gin 

875 880 

age ggg gaa gac cgc cgc aat 2688 

Ser Gly Glu Asp Arg Arg Asn 
890 895 

taa 2718 



<210> 50 
<211> 905 
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<212> PRT 

<213> Ralstonia solanacearum 
<400> 50 

Met Thr Asp His Leu Met Arg Leu Ser Gly Leu Glu Pro Phe Asn lie 
1 S 10 15 

Gly Glu Asp Thr Leu Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr 
20 25 30 

Gly Ser Lys Ala Phe Ala Arg Met lie Leu Asn Ser Gin Phe Asp Glu 
35 40 45 

Ala Leu Ala Val Ala Arg Gin Gin Val Glu Asn Gly Ala Gin Val He 
50 55 60 

Asp He Asn Met Asp Glu Ala Met Leu Asp Ser Lys Ala Ala Met Val 
65 70 75 80 

Arg Phe Leu Asn Leu He Ala Ser Glu Pro Asp He Ala Arg Val Pro 
85 90 95 

He Met He Asp Ser Ser Lys Trp Glu Val He Glu Ala Gly Leu Lys 
100 105 HO 

Cys Val Gin Gly Lys Ala He Val Asn Ser He Ser Leu Lys Glu Gly 
115 120 125 

Glu Glu Gin Phe Ala His His Ala Lys Leu He Lys Arg Tyr Gly Ala 
130 135 140 

Ala Ala Val Val Met Ala Phe Asp Glu Gin Gly Gin Ala Asp Thr Phe 
145 150 155 160 

Ala Arg Lys Thr Glu He Cys Lys Arg Ser Tyr Asp Phe Leu Val Asn 
165 170 175 

Gin Val Gly Phe Ala Pro Glu Asp He He Phe Asp Pro Asn He Phe 
180 185 190 

Ala Val Ala Thr Gly He Glu Glu His Asn Asn Tyr Ala Val Asp Phe 
195 200 205 

He Glu Ala Thr Arg Trp He Lys Gin Lys Leu Pro HiB Ala Lys Val 
210 215 220 

Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Val 
225 230 235 ~ * 240 

Val Arg Glu Ala He His Thr Val Phe Leu Tyr His Ala He Gly Ala 
245 250 255 

Gly Met Asp Met Gly He Val Asn Ala Gly Gin Leu Gly Val Tyr Glu 
260 265 270 

Asn Leu Ala Pro Glu Leu Arg Glu Arg Val Glu Asp Val Val Leu Asn 
275 280 285 

Arg Arg Pro Asp Ala Thr Asp Arg Leu Leu Glu He Ala Asp Arg Tyr 
290 295 300 

Lys Gly Gly Gly Ala Lys Arg Glu Glu Asn Leu Ala Trp Arg Gin Glu 
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305 



310 



315 



320 



Pro Val Qlu Lys Arg Leu Ala His Ala Leu Val His Gly He Thr Asp 
3 25 330 ,335 

Tyr Val Val Glu Asp Thr Glu Glu Val Arg Gin Lys He Phe Ala Ala 
340 345 350 

Gly Gly Arg Pro He Gin Val He Glu Gly Pro Leu Met Asp Gly Met 
355 360 365 

Asn He Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu Pro Gin 
370 375 380 

Val Val Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala His Leu He 
385 390 395 400 

Pro Phe He Glu Glu Glu Lys Arg Gin He Ala Ala Ala Gly Gly Asp 
405 410 415 

Val Arg Ser Arg Gly Lys He Val lie Ala Thr Val Lys Gly Asp Val 
420 425 430 

His Asp He Gly Lys Asn He Val Thr Val Val Leu Gin Cys Asn Asn 
435 440 445 

Phe Glu Val Val Asn Met Gly Val Met Val Pro Cys Asn Glu lie Leu 
450 455 460 

Ala Lys Ala Lys Val Glu Gly Ala Asp lie lie Gly Leu Ser Gly Leu 
465 470 475 480 

He Thr Pro Ser Leu Glu Glu Met Ala Tyr Val Ala Ser Glu Met Gin 
485 490 495 

Arg Asp Glu Tyr Phe Arg Val Lys Lys lie Pro Leu Leu He Gly Gly 
500 505 510 

Ala Thr Thr Ser Arg Val His Thr Ala Val Lys lie Ala Pro Asn Tyr 
515 520 525 

Glu Gly Pro Val Val Tyr Val Pro Asp Ala Ser Arg Ser Val Ser Val 
530 535 540 

Ala Ser Ser Leu Leu Ser Asp Glu Ala Ala Ala Arg Tyr lie Glu Glu 
545 550 555 560 

Leu His Ala Asp Tyr Asp Arg lie Arg Thr Gin His Ala Ser Lys Lys 
565 570 575 

Ala Met Pro Met Val Ser Leu Ala Ala Ala Arg Ala Asn Lys Thr Arg 
580 585 " 590 

He Asp Trp Ser Asn Tyr Thr Pro Pro Lys Pro Lys Phe Val Gly Arg 
595 600 605 

Arg Val Phe Arg Asn Tyr Asp Leu Asn Glu Leu Ala Gin Tyr He Asp 
610 615 620 

Trp Gly Pro Phe Phe Gin Thr Trp Asp Leu Ala Gly Lys Phe Pro Asp 



625 



630 



635 



640 
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He Leu Asn Asp Ala He Val Gly Glu Ser Ala Arg Arg Val Phe Ser 
645 650 655 

Asp Gly Lys Ser Met Leu Ala Arg Leu He Ala Gly Arg Trp Leu Thr 
660 665 670 

Ala Asn Gly Val He Ala Leu Leu Pro Ala Asn Thr Val Asn Asp Asp 
675 680 685 

Asp He Glu He Tyr Thr Asp Glu Thr Arg Ser Glu Val Ala Leu Thr 
690 695 700 

Trp Arg Asn He Arg Gin Gin Ser Glu Arg Pro He He Asp Gly Val 
705 710 715 720 

Met Arg Pro Asn Arg Cys Leu Ala Asp Phe He Ala Pro Lys Asp Thr 
725 730 735 

Gly He Ala Asp Tyr He Gly Leu Phe Ala Val Thr Gly Gly He Gly 
740 745 750 

He Asp Lys Arg Glu Ala Ala Phe Glu Ala Asp His Asp Asp Tyr Ser 
755 760 765 

Ala He Met Leu Lys Ala Leu Ala Asp Arg Phe Ala Glu Ala Phe Ala 
770 775 780 

Glu Cys Leu His Ala Arg Val Arg Arg Asp Leu Trp Gly Tyr Ala Gin 
785 790 795 800 

Asp Glu Thr Leu Asp Asn Asp Ala Leu He Arg Glu Glu Tyr Arg Gly 
805 810 815 

He Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Val Lys 
820 825 830 

Arg Asp Leu Phe Arg Val Leu Asp Ala Gin Glu He Gly Met Asn Leu 
835 840 845 

Thr Glu Ala Leu Ala Met Thr Pro Ala Ala Ser Val Ser Gly Phe Gin 
850 855 860 

Leu Ser His Pro Asp Ser Thr Tyr Phe Thr He Gly Lys He Gly Gin 
86S 870 875 880 

Asp Gin Val Asp Asp Met Ala Ala Arg Ser Gly Glu Ab P Arg Arg Asn 
885 890 895 

Val Glu Arg Ala Leu Ala Pro Asn Leu 
900 9Q5 



<210> 51 
<211> 3645 
<212> DNA 

<213> Chlorobium tepidum 

<220> 
<221> CDS 
<222> (1) . . (3642) 
<223> RCL00420 
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<400> 51 

gtg etc gac ggg gec atg ggc acc atg ate cag agg cat ggc etc gac 48 
Val Leu Aep Gly Ala Met Gly Thr Met lie Gin Arg His Gly Leu Asp 

V 5 10 . ■ , 15 

gaa cag gac tac egg ggc gag cgt ttc get teg cat gac cat ccg ctg 96 
Glu Gin Asp Tyr Arg Gly Glu Arg Phe Ala Ser His Asp His Pro Leu 
20 25 30 

aag ggc aac aac gac ctt ctt gtc ate acc egg ccc gac a%e ate cgt 144 
Lys Gly Asn Asn Asp Leu Leu Val lie Thr Arg Pro Asp lie lie Arg 
35 40 45 

teg ate cac tgc gac ttc etc gac gcg ggt gcg gac ate ate gag acc 192 
Ser lie His Cys Asp Phe Leu Asp Ala Gly Ala Asp lie He Glu Thr 
50 55 60 

tgc acc ttc aac gee aac ccg ate teg cag teg gac tac cag ttg cag 240 
Cys Thr Phe Asn Ala Asn Pro He Ser Gin Ser Asp Tyr Gin Leu Gin 
65 70 75 80 

gac ttg acc cgc gag ctg aac gtg gcg gcg gca aag ata gee cgc teg 288 
Asp Leu Thr Arg Glu Leu Asn Val Ala Ala Ala Lys He Ala Arg Ser 
85 90 95 

gca gcg gac gag ttc acc gca aag act ccc gac aag ccg cgt ttc gtg 336 
Ala Ala Asp Glu Phe Thr Ala Lys Thr Pro Asp Lys Pro Arg Phe Val 
100 105 * 110 

gee ggt tec ate gga ccg acc aac aag acg etc teg etc teg ccg gac 384 
Ala Gly Ser He Gly Pro Thr Asn Lys Thr Leu Ser Leu Ser Pro Asp 
115 120 X25 

gtg aac aac ccc ggc ttc cgc gee gtc acc ttc cag gag atg gtc gat 432 
Val Asn Asn Pro Gly Phe Arg Ala Val Thr Phe Gin Glu Met Val Asp 
130 135 140 

aac tac act gee cag etc gaa ggc ttg cac gag ggc ggt gtc gat etc 480 
Asn Tyr Thr Ala Gin Leu Glu Gly Leu His Glu Gly Gly Val Asp Leu 
145 150 155 160 

ttg etc gtc gag acg gtg ttc gac aca ctg aac tgc aag gcg gcg etc 528 
Leu Leu Val Glu Thr Val Phe Asp Thr Leu Asn Cys Lys Ala Ala Leu 
165 170 175 

tac get ate gag gag tac gcg gtg aaa acc ggc tgg cag gtg ccc gtg 576 
Tyr Ala He Glu Glu Tyr Ala Val Lys Thr Gly Trp Gin Val Pro Val 
180 185 ' * 190 

atg gtc tec ggc acg gtg gtg gac gcg age ggc cgc acc etc tec ggc 624 
Met Val Ser Gly Thr Val Val Asp Ala Ser Gly Arg Thr Leu Ser Gly 
195 200 205 

caa acc acc gag gcg ttc tgg att teg att teg cac atg ccg agt ctg 672 
Gin Thr Thr Glu Ala Phe Trp He Ser lie Ser His Met Pro Ser Leu 
210 215 220 

etc teg gtc ggc ctg aac tgc gca etc ggc tec aag cag atg cgc ccc 720 
Leu Ser Val Gly Leu Asn Cys Ala Leu Gly Ser Lys Gin Met Arg Pro 
225 230 235 240 

ttc ate gag gcg etc teg aac ate gee gaa age tac gtc age gtc tat 768 
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Phe He Qlu Ala Leu Ser Asn He Ala Glu Ser Tyr Val Ser Val Tyr 
245 250 255 

ccc aac gcg ggc ctg ccg aat gag ttc ggc gag tac. gac gac tec ccc 816 
Pro Asn Ala Gly Leu Pro Asn Glu Phe Gly Glu Tyr Asp Asp Ser Pro 
260 265 270 

gag tac atg gec gcg cag ate gcg ggc ttc gec gaa tea ggc ttc gtg 864 
Glu Tyr Met Ala Ala Gin He Ala Gly Phe Ala Glu Ser Gly Phe Val 
275 280 285 

aac ate gtc ggc ggc tgc tgc ggc acc acg ccg acg cac ate cgc gee 912 
Asn He Val Gly Gly Cys Cys Gly Tnr Thr Pro Thr His He Arg Ala 
290 295 300 

att gee gaa gcg gtc aag act etc ccg ccg aga aag cgc ccc gee aac 960 
He Ala Glu Ala Val Lys Thr Leu Pro Pro Arg Lys Arg Pro Ala Asn 
305 310 315 320 

aag cac gtg ctg agg etc tec ggc etc gaa ccg etc gtg gtt gac gaa 1008 
Lys His Val Leu Arg Leu Ser Gly Leu Glu Pro Leu Val Val Asp Glu 
325 330 335 

acc acc ggc ttc ate aac gtc ggc gag cgc acc aac gtc acc ggt teg 1056 
Thr Thr Gly Phe He Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser 
340 345 350 

cgc aag ttc gee cgc etc ate aag gag gec aat tac gac gaa gcg etc 1104 
Arg Lys Phe Ala Arg Leu He Lys Glu Ala Asn Tyr Asp Glu Ala Leu 
355 360 365 

tec att gec cgc cag cag gtc gag aac ggc gcg cag gtg ate gac gtg 1152 
Ser He Ala Arg Gin Gin Val Glu Asn Gly Ala Gin Val He Asp Val 
370 375 380 

aac etc gac gaa gga atg etc gac tec gaa aag gtg ate gtc gaa ttc 1200 
Asn Leu Asp Glu Gly Met Leu Asp Ser Glu Lys Val He Val Glu Phe 
385 390 395 400 

ctg aac etc ate gec tec gag cct gag ate gec aag gtg ccg gtg atg 124 8 
Leu Asn Leu He Ala Ser Glu Pro Glu He Ala Lys Val Pro Val Met 
405 410 ^ 415 

ate gac teg teg aaa tgg teg gtc ate gaa aac ggc ctg cgc tgc acc 1296 
He Asp Ser Ser Lys Trp Ser Val He Glu Asn Gly Leu Arg Cys Thr 
420 425 430 

cag ggc aag age ate gtc aac teg ate age etc aag gag ggc gag gag 1344 
Gin Gly Lys Ser He Val Asn Ser He Ser Leu Lys Glu Gly Glu Glu 
435 440 445 

ctg ttc aag gag cgc get cgc aag ate atg caa tac ggc gcg gcg gcg 1392 
Leu Phe Lys Glu Arg Ala Arg Lys He Met Gin Tyr Gly Ala Ala Ala 
450 455 460 

gtg gtc atg gee ttc gac gag cag ggc cag gee gac age ctg cac cgc 1440 
Val Val Met Ala Phe Asp Glu Gin Gly Gin Ala Asp Ser Leu His Arg 
465 470 475 480 



cgc ate gag att tgc age cgc gee tac aaa att etc acc gaa gag gtg 
Arg lie Glu He Cys Ser Arg Ala Tyr Lys He Leu Thr Glu Glu Val 
485 490 495 



1488 
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ggc ttc ccg ccg gag gac ate ate ttt gac ccg aac gtg ctg ace gtg 1536 
Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Asn Val Leu Thr Val 
500 505 510 < 

gec ace ggc ate gac gag cac aac aac tac gcg etc gac ttc ate gaa 1584 
Ala Thr Gly He Asp Glu His Asn Asn Tyr Ala Leu Asp Phe He Glu 
515 520 525 

age gtg. cgc tgg ate aag cag aac ctg ccg cac gcg aag gtc tec ggc 1632 
Ser Val Arg Trp He Lys Gin Asn Leu Pro His Ala Lys Val Ser Gly 
530 535 540 

ggc ate age aac gtt teg ttc tec ttc cgc ggc aac gag ccg gtg cgc 1680 
Gly He Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg 
545 550 555 560 

gag gcg atg cac ace gcg ttc etc tac cac gee ate cac gee ggt etc 1728 
Glu Ala Met His Thr Ala Phe Leu Tyr His Ala He His Ala Gly Leu 
565 570 575 

gac atg ggc ate gtc aac gee gee cag ctt ggc ate tac gaa gag ate 1776 
Asp Met Gly He Val Asn Ala Ala Gin Leu Gly lie Tyr Glu Glu He 
560 585 590 

gac ccg gag ctt ctt gtc tat gtc gag gac gtg ctg ctg aac cgc cgc 1824 
Asp Pro Glu Leu Leu Val Tyr Val Glu Asp Val Leu Leu Asn Arg Arg 
595 600 605 

gac gac gec acc gag egg etc gtg gcg ttc get gaa acg ate cgc gac 1872 
Asp Asp Ala Thr Glu Arg Leu Val Ala Phe Ala Glu Thr He Arg Asp 
610 615 620 

ggc ggc gaa aag gee gag gee aag aac gee gaa tgg cgc aac gee ccg 1920 
Gly Gly Glu Lys Ala Glu Ala Lys Asn Ala Glu Trp Arg Asn Ala Pro 
625 630 635 640 

gtc gag gag egg ctg aaa cac gcg etc gtc aag ggc ate gtt gac tac 1968 
Val Glu Glu Arg Leu Lys His Ala Leu Val Lys Gly He Val Asp Tyr 
645 650 655 

ate gac gag gac acc gaa gag gee cgc cag etc tac ccg agt ccg ctg 2016 
He Asp Glu Asp Thr Glu Glu Ala Arg Gin Leu Tyr Pro Ser Pro Leu 
660 665 670 

gag gtg ate gag ggg ccg etc atg aac ggc atg aac cac gtc ggc gac 2064 
Glu Val He Glu Gly Pro Leu Met Asn Gly Met Asn His Val Gly Asp 
675 680 685 

etc ttc gec gaa ggc aag atg ttc ctg cca cag gtg gtc aaa age gec 2112 
Leu Phe Ala Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala 
690 695 700 

cgc gtc atg aag cgc teg gta get gcg ctg att ccc tat ate gag gag 2160 
Arg Val Met Lys Arg Ser Val Ala Ala Leu He Pro Tyr He Glu Glu 
705 710 715 720 

gag aag teg aaa aac tgc gac acg age gee aaa gec aag gtg ctg etc 2208 
Glu Lys Ser Lys Asn Cys Asp Thr Ser Ala Lys Ala Lys Val Leu Leu 
725 730 735 

gec acg gtg aag ggc gac gtg cac gac ate ggc aag aac ate gtg teg 2256 
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Ala Thr Val Lys Gly Asp Val His Asp lie Gly Lys ABn lie Val Ser 
740 745 750 

gtg gtg ctt gcc tgc aac aac ttc gac gtg ate gac ate ggc gtc atg 2304 
Val Val Leu Ala Cys Asn Asn Phe Asp Val lie Asp He Gly Val Met 
755 760 765 

atg cca tgc gac aag att etc gaa gcg ctg gca gaa cac aag ccc gac 2352 
Met Pro Cys Asp Lys lie Leu Glu Ala Leu Ala Glu His Lys Pro Asp 
770 775 780 



gtg etc ggc etc tec ggc etc ate acc ccg teg etc gaa gag atg gcg 
Val Leu Gly Leu Ser Gly Leu He Thr Pro Ser Leu Glu Glu Met Ala 
785 790 795 



800 



2400 



cac gtg gcc aaa gag atg gag egg etc ggc atg aac att ccg etc ate 2448 
His Val Ala LyB Glu Met Glu Arg Leu Gly Met Asn He Pro Leu He 
805 810 815 

ate ggc ggc gcg acc acc teg aag gtg cac acg gcg gtg aaa etc gcg 2496 
He Gly Gly Ala Thr Thr Ser LyB Val His Thr Ala Val Lys Leu Ala 
820 825 830 

ccc tgc tac ccc age ggc gcg gta~gta- cac gtg etc gac gcc teg cgc 2544 
Pro Cye Tyr Pro Ser Gly Ala Val Val His Val Leu Asp Ala Ser Arg 
835 640 845 

age gtg ccg gtg gtc age aac etc tgc aac ccc gcc cag cgc gac age 2592 
Ser Val Pro Val Val Ser Asn Leu Cys Asn Pro Ala Gin Arg Asp Ser 
850 855 860 

tat ate gcg gcg ctg aag gat gag cag gag gcg atg cgc aag age cac 2640 
Tyr He Ala Ala Leu Lys Asp Glu Gin Glu Ala Met Arg Lys Ser His 
865 870 875 * 880 

gcc gag cgc atg gcg gca aaa aag tac gtc teg etc gac gcc gcc cgc 2688 
Ala Glu Arg Met Ala Ala Lys Lys Tyr Val Ser Leu Asp Ala Ala Arg 
885 890 895 

gac aac cgc etc acc att gac tgg gag gcc gaa acc ate gac aag ccc 2736 
Asp Asn Arg Leu Thr He Asp Trp Glu Ala Glu Thr He Asp Lys Pro 
900 90S 910 

gcc cag act ggc gtc acc gtg ctg gag gat gtc acc gtc ggc gcg etc 2784 
Ala Gin Thr Gly Val Thr Val Leu Glu Asp Val Thr Val Gly Ala Leu 
915 920 925 

cgc ccg tat ate gac tgg gca mcc ttc ttc tgg age tgg gag ctg cac 2832 
Arg Pro Tyr He Asp Trp Ala Xaa Phe Phe Trp Ser Trp Glu Leu His 
930 935 940 

ggc gtc tat ccg cag att ctg gag gat gaa aag gtc ggc gag gag gca 2880 
Gly Val Tyr Pro Gin He Leu Glu Asp Glu Lys Val Gly Glu Glu Ala 
945 950 955 960 

acc aaa etc ttc aac gac gcc acc get ctg etc gac egg ate gac age 2928 
Thr Lys Leu Phe Asn Asp Ala Thr Ala Leu Leu Asp Arg He Asp Ser 
965 970 975 

gaa aag ctg etc ggc ate aaa ggc gtg gcg ggc ate ttc ccg gcc aac 2976 
Glu Lys Leu Leu Gly He Lys Gly Val Ala Gly He Phe Pro Ala Asn 
980 985 990 
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age ate ggc gac gac ate ttc gtc tat gcg gat gac gag cqc tea ata ™sa 
Ser lie Gly Asp Asp lie Phe Val Tyr Ala Lp Lp H Sg 2 £ 3024 
995 i°oo 100S 

?i C l 9C ^° 9t ? ° t9 Cac acc cts cgc "9 caa 9S C 9aa aag cac ggc 3072 
lie Arg Thr Val Leu His Thr Leu Arg Gin Gin lly Glu Ly? His 111 

1010 1015 1020 

Pi! f? 9 aa ° f tC 9C9 Ct9 909 9ac ttc 9tg gec ccg cgc gaa age ggc 3120 
Glu Ala Asn Leu Ala Leu Ala Asp Phe Val Ala Pro A?g Slu sir III 

1030 1035 ioio 

gtc aac gac tgg ate ggc tgc ttc acc gta acc gec gga etc ggc ate 3isa 
Val Asn Asp Trp He Gly Cye Phe Thr Val Thr La Sly III 111 ill ° 
104S 1050 loss 

cag aat ttg etc gac gag ttc aca gca gag aac gac gac tac cac cgc 32lfi 
Gin Asn Leu Leu Asp Glu Phe Thr Ala Glu Asn Lp Lp Tyr Ss Sg 
1060 1065 1070 

III Me? a? 9 ??° 9aC C9a Ct9 9CC 9aa 9*9 ttc 9« 9ag 3264 

lie Met Thr Gin Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu 

1075 10BO io85 

atg ctg cac gaa aag gtg cgc cgc gaa etc tgg ggc tac gcg ccc aac 331? 

Nioo Hi8 ° 1U LyS VSl Glu Leu T S «y Tyr 5! Ill 111 

1090 1095 1100 

gaa ate etc ggc aac gaa gag ctg ate gee gaa aag tac cga ggc ate 3360 
?5s 7 ?uo GlU ^ Ma ?u S Lya 1/1 ** 9 " y " a 

cgc ccc gee ccc ggc tac ccc gee tgc ccg gat cac acc gaa aag gca 3408 
Arg Pro Ala Pro Gly Tyr Pro Ala Cye Pro Asp His Thr Glu Ly! Ma 
112 $ 1130 

lit Til til ? a ° f t9 ° tC 330 9Ct 9M 9C9 gcc acc 99C gtc acg ctg 3456 
He lie Phe Asp Leu Leu Asn Ala Glu Ala Ala Thr Gly Val Thr Leu 

1140 "45 i 150 

acg gaa act ttc gcg atg aac ccc gca gcc tea gtc tgc ggc etc tac 3504 
Thr Glu Thr Phe Ala Met Asn Pro Ala Ala Ser Val Cys Sly III tJt 
115S ii6o n 65 ' 

ttc gcc aac ccg gcc teg aaa tac ttc gta etc ggc aag att ggt aaa 3552 
Phe Ala Asn Pro Ala Ser Lys Tyr Phe Val Leu Gly Ly! III Jg lyl 
1170 11? 5 1180 

gat cag gtc gaa gac tac gcc aac cgc aaa qqq cto aaa ota a n* « aa 

Asp Gin Val Glu Asp Tyr La Asn Arg Lys 1?? 2 £ va? L" K 3600 

1185 1190 1195 1200 

gcc gag aag tgg etc gcg ccc teg ctg aac tac gat cca gcg 364 2 
Ala Glu Lys Trp Leu Ala Pro Ser Leu Asn Tyr Asp Pro Ala 
1205 1210 



taa 



3645 



<210> 52 
<211> 1214 
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<212> PRT 

<213> Chlorobium tepidum 
<220> 

<221> unsure 
<222> 936 ..936 

<223> All occurrences of Xaa indicate any amino acid 
<400> 52 

Val Leu Asp Gly Ala Met Gly Thr Met lie Gin Arg His Gly Leu Asp 
1 5 10 15 

Glu Gin Asp Tyr Arg Gly Glu Arg Phe Ala Ser His Asp His Pro Leu 
20 25 30 

Lys Gly Asn Asn Asp Leu Leu Val lie Thr Arg Pro Asp He He Arg 
35 40 45 

Ser He His Cys Asp Phe Leu Asp Ala Gly Ala Asp lie He Glu Thr 
50 55 60 

Cys Thr Phe Asn Ala Asn Pro He Ser Gin Ser Asp Tyr Gin Leu Gin 
« 70 75 * 80 

Asp Leu Thr Arg Glu Leu Asn Val Ala Ala Ala Lys He Ala Arg Ser 
85 90 95 

Ala Ala Asp Glu Phe Thr Ala Lys Thr Pro Asp Lys Pro Arg Phe Val 
100 105 HO 

Ala Gly Ser He Gly Pro Thr Asn Lys Thr Leu Ser Leu Ser Pro Asp 
115 120 125 

Val Asn Asn Pro Gly Phe Arg Ala Val Thr Phe Gin Glu Met Val Asp 
130 135 140 

Asn Tyr Thr Ala Gin Leu Glu Gly Leu His Glu Gly Gly Val Asp Leu 
145 150 155 160 

Leu Leu Val Glu Thr Val Phe Asp Thr Leu Asn Cys Lys Ala Ala Leu 
165 170 175 

Tyr Ala He Glu Glu Tyr Ala Val Lys Thr Gly Trp Gin Val Pro Val 
180 185 * 190 

Met Val Ser Gly Thr Val Val Asp Ala Ser Gly Arg Thr Leu Ser Gly 
195 200 205 

Gin Thr Thr Glu Ala Phe Trp He Ser He Ser His Met Pro Ser Leu 
210 215 220 

Leu Ser Val Gly Leu Asn Cys Ala Leu Gly Ser Lys Gin Met Arg Pro 
225 230 235 * 240 

Phe He Glu Ala Leu Ser Asn He Ala Glu Ser Tyr Val Ser Val Tyr 
245 250 255 

Pro Asn Ala Gly Leu Pro Asn Glu Phe Gly Glu Tyr Asp Asp Ser Pro 
260 265 270 

Glu Tyr Met Ala Ala Gin He Ala Gly Phe Ala Glu Ser Gly Phe Val 
275 280 285 
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Asn lie Val Gly Gly Cys Cys Gly Thr Thr Pro Thr His lie Arg Ala 
290 295 300 

lie Ala Glu Ala Val Lys Thr Leu Pro Pro Arg LyB Arg Pro Ala Asn 
305 310 315 • 320 

Lys His Val Leu Arg Leu Ser Gly Leu Glu Pro Leu Val Val Aap Glu 
325 330 335 

Thr Thr Gly Phe He Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser 
340 34S 350 

Arg Lys Phe Ala Arg Leu He Lys Glu Ala Asn Tyr Asp Glu Ala Leu 
355 360 365 

Ser He Ala Arg Gin Gin Val Glu Asn Gly Ala Gin Val He Asp Val 
370 375 380 

Asn Leu Asp Glu Gly Net Leu Asp Ser Glu Lys Val He Val Glu Phe 
385 390 395 400 

Leu Asn Leu He Ala Ser Glu Pro Glu He Ala Lys Val Pro Val Met 
405 410 415 

He Asp Ser Ser Lys Trp Ser Val He Glu Asn Gly Leu Arg Cys Thr 
420 425 430 

Gin Gly Lys Ser He Val Asn Ser He Ser Leu Lys Glu Gly Glu Glu 
435 440 445 

Leu Phe Lys Glu Arg Ala Arg Lys He Met Gin Tyr Gly Ala Ala Ala 
450 455 460 

Val Val Met Ala Phe Asp Glu Gin Gly Gin Ala Asp Ser Leu His Arg 
465 470 475 480 

Arg He Glu He Cys Ser Arg Ala Tyr Lys He Leu Thr Glu Glu Val 
485 490 495 

Gly Phe Pro Pro Glu Asp lie lie Phe Asp Pro Asn Val Leu Thr Val 
500 505 510 

Ala Thr Gly lie Asp Glu His Asn Asn Tyr Ala Leu Asp Phe He Glu 
515 520 525 

Ser Val Arg Trp He Lys Gin Asn Leu Pro His Ala Lys Val Ser Gly 
530 535 540 

Gly He Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg 
545 550 555 560 

Glu Ala Met His Thr Ala Phe Leu Tyr His Ala He His Ala Gly Leu 
565 570 575 

Asp Met Gly lie Val Asn Ala Ala Gin Leu Gly He Tyr Glu Glu lie 
580 585 590 

Asp Pro Glu Leu Leu Val Tyr Val Glu Asp Val Leu Leu Asn Arg Arg 
595 600 605 

Asp Aap Ala Thr Glu Arg Leu Val Ala Phe Ala Glu Thr He Arg Asp 
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610 615 620 

Gly Gly Glu Lys Ala Glu Ala Lys Asn Ala Glu Trp Arg Asn Ala Pro 
G25 630 635 640 

Val Glu Glu Arg Leu Lys His Ala Leu Val LyB Gly He Val Asp Tyr 
645 650 655 

He Asp Glu Asp Thr Glu Glu Ala Arg Gin Leu Tyr Pro Ser Pro Leu 
660 665 670 

Glu Val He Glu Gly Pro Leu Met Asn Gly Met Asn His Val Gly Asp 
675 680 685 

Leu Phe Ala Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala 
690 695 700 

Arg Val Met Lys Arg Ser Val Ala Ala Leu He Pro Tyr He Glu Glu 
705 710 715 720 

Glu Lys Ser Lys Asn CyB Asp Thr Ser Ala Lys Ala Lys Val Leu Leu 
725 730 735 

Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val Ser 
740 745 750 

Val Val Leu Ala Cys Asn Asn Phe Asp Val He Asp He Gly Val Met 
755 760 765 

Met Pro Cys Asp Lys He Leu Glu Ala Leu Ala Glu His Lys Pro Asp 
770 775 780 

Val Leu Gly Leu Ser Gly Leu He Thr Pro Ser Leu Glu Glu Met Ala 
785 790 795 800 

His Val Ala Lys Glu Met Glu Arg Leu Gly Met Asn He Pro Leu He 
805 810 815 

He Gly Gly Ala Thr Thr Ser Lys Val His Thr Ala Val Lys Leu Ala 
820 825 830 

Pro Cys Tyr Pro Ser Gly Ala Val Val His Val Leu Asp Ala Ser Arg 
835 840 845 

Ser Val Pro Val Val Ser Asn Leu Cys Asn Pro Ala Gin Arg Asp Ser 
850 855 860 

Tyr He Ala Ala Leu Lys Asp Glu Gin Glu Ala Met Arg Lys Ser His 
865 870 875 880 

Ala Glu Arg Met Ala Ala Lys Lys Tyr Val Ser Leu Asp Ala Ala Arg 
885 890 895 

Asp Asn Arg Leu Thr He Asp Trp Glu Ala Glu Thr He Asp Lys Pro 
900 90S 910 

Ala Gin Thr Gly Val Thr Val Leu Glu Asp Val Thr Val Gly Ala Leu 
915 920 925 

Arg Pro Tyr He Asp Trp Ala Xaa Phe Phe Trp Ser Trp Glu Leu His 
930 935 940 



WO 03/087386 PCT/EP03/04010 

232 

Gly Val Tyr Pro Gin lie Leu Glu Asp Glu Lye Val Gly Glu Glu Ala 
545 950 955 960 

Thr Lys Leu Phe Asn Asp Ala Thr Ala Leu Leu Asp Arg He- Asp Ser 
965 970 975 

Glu Lys Leu Leu Gly He Lys Gly Val Ala Gly He Phe Pro Ala Asn 
980 985 990 

Ser He Gly Abp Asp He Phe Val Tyr Ala Asp Asp Glu Arg Ser He 
995 1000 1005 

He Arg Thr Val Leu His Thr Leu Arg Gin Gin Gly Glu Lys His Gly 
1010 1015 1020 

Glu Ala Asn Leu Ala Leu Ala Asp Phe Val Ala Pro Arg Glu Ser Gly 
1025 1030 1035 1040 

Val Asn Asp Trp He Gly Cys Phe Thr Val Thr Ala Gly Leu Gly He 
1045 1050 1055 

Gin Asn Leu Leu Asp Glu Phe Thr Ala Glu Asn Asp Asp Tyr His Arg 
1060 106S 1070 

He Met Thr Gin Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu 
1075 1080 1085 

Met Leu His Glu Lys Val Arg Arg Glu Leu Trp Gly Tyr Ala Pro Gly 
1090 1095 1100 

Glu He Leu Gly Asn Glu Glu Leu He Ala Glu Lys Tyr Arg Gly He 
1105 1110 ins H20 

Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp His Thr Glu Lys Ala 
1125 1130 H35 

He He Phe Asp Leu Leu Asn Ala Glu Ala Ala Thr Gly Val Thr Leu 
1140 1145 H50 

Thr Glu Thr Phe Ala Met Asn Pro Ala Ala Ser Val Cys Gly Leu Tyr 
1155 1160 H65 

Phe Ala Asn Pro Ala Ser Lys Tyr Phe Val Leu Gly Lys He Gly Lys 
1170 1175 1180 

Asp Gin Val Glu Asp Tyr Ala Asn Arg Lys Gly Leu Glu Val Ala Glu 
1185 1190 H95 1200 

Ala Glu Lys Trp Leu Ala Pro Ser Leu Asn Tyr Asp Pro Ala 
1205 1210 



<210> 53 

<211> 52 

<212> DNA 

<213> Kunstliche Sequenz 



<220> 

<223> Beschreibung der kunstlichen Sequenz :PCR primer 



<400> 53 

cccgggatcc gctagcggcg cgccggccgg cccggtgtga aataccgcac ag 52 
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<210> 54 
<211> 53 
<212> DNA 

<213> Kunstliche Sequenz 
<220> 

<223> Beschreibung der kunstlichen Sequenz :PCR primer 
<400> 54 

tctagactcg agcggccgcg gccggccttt aaattgaaga cgaaagggcc teg 53 

<210> 55 
<211> 47 
<212> DNA 

<213> Kunstliche Sequenz 
<220> 

<223> Beschreibung der kunBtlichen Sequenz :PCR primer 
<400> 55 

gagatctaga cccggggatc cgctagcggg ctgetaaagg aagcgga 47 

<210> 56 
<211> 38 
<212> DNA 

<213> Kunstliche Sequenz 
<220> 

<223> Beschreibung der kunstlichen Sequenz : PGR primer 
<400> 56 

gagaggegeg ccgctagcgt gggcgaagaa ctccagca 38 

<210> 57 
<211> 34 
<212> DNA 

<213> Kunstliche Sequenz 
<220> 

<223> Beschreibung der kunstlichen Sequenz :PCR primer 
<400> 57 

gagagggegg ccgcgcaaag tcccgcttcg tgaa 34 

<210> 58 
<211> 34 
<212> DNA 

<213> Kunstliche Sequenz 
<220> 

<223> Beschreibung der kunBtlichen Sequenz :PCR primer 



<400> 58 

gagagggegg ccgctcaagt cggtcaagcc aege 



34 
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<210> 59 
<211> 140 
<212> DNA 

<2l?> Kflnstliche Seguenz 
<220> 

<223> Beschreibung der kunstlichen SequenzjPCR primer 
<400> 59 

tcgaatttaa atctcgagag gcctgacgtc gggcccggta ccacgcgtca < tatgactagt 60 
tcggacctag ggatatcgtc gacatcgatg ctcttctgcg ttaattaaca attgggatcc 120 
tctagacccg ggatttaaat 140 



<210> 60 
<211> 140 
<212> DNA 

<213> Kunstliche Sequenz 
<220> 

<223> Beschreibung der kunstlichen Sequenz :PCR primer 
<400> 60 

gatcatttaa atcccgggtc tagaggatcc caattgttaa ttaacgcaga agagcatcga 60 
tgtcgacgat atccctaggt ccgaactagt catatgacgc gtggtaccgg gcccgacgtc 120 
aggcctctcg agatttaaat 140 



<210> 61 
<211> 33 
<212> DNA 

<213> Kunstliche Sequenz 
<220> 

<223> Beschreibung der kunstlichen Sequenz :PCR primer 
<400> 61 

gagagcggcc gccgatcctt tttaacccat cac 33 



<210> 62 
<211> 32 
<212> DNA 

<213> Kunstliche Sequenz 
<220> 

<223> Beschreibung der kunstlichen Sequenz :PCR primer 
<400> 62 

aggagcggcc gccatcggca ttttcttttg eg 32 



<210> 63 
<211> 5091 
<212> DNA 

<213> Kunstliche Sequenz 
<220> 

<223> Beschreibung der kunstlichen Sequenz : Plasmid 
<400> 63 

gccgcgactg ccttcgcgaa gccttgcccc gcggaaattt cctccaccga gttcgtgcac 60 
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acccctatgc caagcttctt tcaccctaaa ttcgagagat tggattctta ccgtggaaat 120 
tcttcgcaaa aatcgtcccc tgatcgccct tgcgacgttg gcgtcggtgc cgctggttgc 180 
gcttggcttg accgacttga tcagcggccg ctcgatttaa atctcgagag gcctgacgtc 240 
gggcccggta ccacgcgtca tatgactagt tcggacctag ggatatcgtc gacatcgatg 300 
ctcttctgcg ttaattaaca attgggatcc tctagacccg ggatttaaat cgctagcggg 360 
ctgctaaagg aagcggaaca cgtagaaagc cagtccgcag aaacggtgct gaccccggat 420 
gaatgtcagc tactgggcta tctggacaag ggaaaacgca agcgcaaaga gaaagcaggt 480 
agcttgcagt gggcttacat ggcgatagct agactgggcg gttttatgga cagcaagcga 540 
accggaattg ccagctgggg cgccctctgg taaggttggg aagccctgca aagtaaactg 600 
gatggctttc ttgccgccaa ggatctgatg gcgcagggga tcaagatctg atcaagagac 660 
aggatgagga tcgtttcgca tgattgaaca agatggattg cacgcaggtt ctccggccgc 720 
ttgggtggag aggctattcg gctatgactg ggcacaacag acaatcggct gctctgatgc 780 
cgccgtgttc cggctgtcag cgcaggggcg cccggttctt tttgtcaaga ccgacctgtc 840 
cggtgccctg aatgaactgc aggacgaggc agcgcggcta tcgtggctgg ccacgacggg 900 
cgttccttgc gcagctgtgc tcgacgttgt cactgaagcg ggaagggact ggctgctatt 960 
gggegaagtg ccggggcagg atctcctgtc atctcacctt gctcctgccg agaaagtatc 1020 
catcatggct gatgcaatgc ggcggctgca tacgcttgat ccggctacct gcccattcga 1080 
ccaccaagcg aaacatcgca tcgagcgagc acgtactcgg atggaagccg gtcttgtcga 1140 
tcaggatgat ctggacgaag agcatcaggg gctcgcgcca gccgaactgt tcgccaggct 1200 
caaggcgcgc atgcccgacg gcgaggatct cgtcgtgacc catggcgatg cctgcttgcc 1260 
gaatatcatg gtggaaaatg gccgcttttc tggattcatc gactgtggcc ggctgggtgt 1320 
ggcggaccgc tatcaggaca tagcgttggc tacccgtgat attgctgaag agcttggcgg 1380 
cgaatgggct gaccgcttcc tcgtgcttta cggtatcgcc gctcccgatt cgcagcgcat 1440 
cgccttctat cgccttcttg acgagttctt ctgagcggga ctctggggtt cgaaatgacc 1500 
gaccaagcga cgcccaacct gccatcacga gatttcgatt ccaccgccgc cttctatgaa 1560 
aggttgggct tcggaatcgt tttccgggac gccggctgga tgatcctcca gcgcggggat 1620 
ctcatgctgg agttcttcgc ccacgctagc ggcgcgccgg ccggcccggt gtgaaatacc 1680 
gcacagatgc gtaaggagaa aataccgcat caggcgctct tccgcttcct cgctcactga 1740 
ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat 1800 
acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca 1860 
aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc 1920 
tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata 1980 
aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc 2040 
gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc 2100 
acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga 2160 
accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc 2220 
ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag 2280 
gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag 2340 
gacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag 2400 
ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca 2460 
gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga 2520 
cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat 2580 
cttcacctag atccttttaa aggccggccg cggccgcgca aagtcccgct tcgtgaaaat 2640 
tttcgtgccg cgtgattttc cgccaaaaac tttaacgaac gttcgttata atggtgtcat 2700 
gaccttcacg acgaagtact aaaattggcc cgaatcatca gctatggatc tctctgatgt 2760 
cgcgctggag tccgacgcgc tcgatgctgc cgtcgattta aaaacggtga tcggattttt 2820 
ccgagctctc gatacgacgg acgcgccagc atcacgagac tgggccagtg ccgcgagcga 2880 
cctagaaact ctcgtggcgg atcttgagga gctggctgac gagctgcgtg ctcggccagc 2940 
gccaggagga cgcacagtag tggaggatgc aatcagttgc gcctactgcg gtggcctgat 3000 
ccctccccgg cctgacccgc gaggacggcg cgcaaaatat tgctcagatg cgtgtcgtgc 3060 
cgcagccagc cgcgagcgcg ccaacaaacg ccacgccgag gagctggagg cggctaggtc 3120 
gcaaatggcg ctggaagtgc gtcccccgag cgaaattttg gccatggtcg tcacagagct 3180 
ggaagcggca gcgagaatta tcgcgatcgt ggcggtgccc gcaggcatga caaacatcgt 3240 
aaatgccgcg tttcgtgtgc cgtggccgcc caggacgtgt cagcgccgcc accacctgca 3300 
ccgaatcggc agcagcgtcg cgcgtcgaaa aagcgcacag gcggcaagaa gcgataagct 3360 
gcacgaatac ctgaaaaatg ttgaacgccc cgtgagcggt aactcacagg gcgtcggcta 3420 
acccccagtc caaacctggg agaaagcgct caaaaatgac tctagcggat tcacgagaca 3480 
ttgacacacc ggcctggaaa ttttccgctg atctgttcga cacccatccc gagctcgcgc 3540 
tgcgatcacg tggctggacg agcgaagacc gccgcgaatt cctcgctcac ctgggcagag 3600 
aaaatttcca gggcagcaag acccgcgact tcgccagcgc ttggatcaaa gacccggaca 3660 
cggagaaaca cagccgaagt tataccgagt tggttcaaaa tcgcttgccc ggtgccagta 3720 
tgttgctctg acgcacgcgc agcacgcagc cgtgcttgtc ctggacattg atgtgccgag 3780 
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ccaccaggcc ggcgggaaaa tcgagcacgt aaaccccgag gtctacgcga ttttggagcg 3840 
ctgggcacgc ctggaaaaag cgccagcttg gatcggcgtg aatccactga gcgggaaatg 3900 
ccagctcatc tggctcattg atccggtgta tgccgcagca ggcatgagca gcccgaatat 3960 
gcgcctgctg gctgcaacga ccgaggaaat gacccgcgtt ttcggcgctg accaggcttt 4020 
ttcacatagg ctgagccgtg gccactgcac tctccgacga tcccagccgt accgctggca 4060 
tgcccagcac aatcgcgtgg atcgcctagc tgatcttatg gaggttgctc gcatgatctc 4140 
aggcacagaa aaacctaaaa aacgctatga gcaggagttt tctagcggac gggcacgtat 4200 
cgaagcggca agaaaagcca ctgcggaagc aaaagcactt gccacgcttg aagcaagcct 4260 
gccgagcgcc gctgaagcgt ctggagagct gatcgacggc gtccgtgtcc tctggactgc 4320 
tccagggcgt gccgcccgtg atgagacggc ttttcgccac gctttgactg, tgggatacca 4380 
gttaaaagcg gctggtgagc gcctaaaaga caccaagggt catcgagcct acgagcgtgc 4440 
ctacaccgtc gctcaggcgg tcggaggagg ccgtgagcct gatctgccgc cggactgtga 4500 
ccgccagacg gattggccgc gacgtgtgcg cggctacgtc gctaaaggcc agccagtcgt 4560 
ccctgctcgt cagacagaga cgcagagcca gccgaggcga aaagctctgg ccactatggg 4620 
aagacgtggc ggtaaaaagg ccgcagaacg ctggaaagac ccaaacagtg agtacgcccg 4680 
agcacagcga gaaaaactag ctaagtccag tcaacgacaa gctaggaaag ctaaaggaaa 4740 
tcgcttgacc attgcaggtt ggtttatgac tgttgaggga gagactggct cgtggccgac 4800 
aatcaatgaa gctatgtctg aatttagcgt gtcacgtcag accgtgaata gagcacttaa 4860 
ggtctgcggg cattgaactt ccacgaggac gccgaaagct tcccagtaaa tgtgccatct 4920 
cgtaggcaga aaacggttcc cccgtagggt ctctctcttg gcctcctttc taggtcgggc 4980 
tgattgctct tgaagctctc taggggggct cacaccatag gcagataacg ttccccaccg 5040 
gctcgcctcg taagcgcaca aggactgctc ccaaagatct tcaaagccac t 5091 



<210> 64 
<211> 4323 
<212> DNA 

<213> Kilns tliche Sequenz 
<220> 

<223> Beschreibung der kunstlichen Sequenz : Plasmid 
<400> 64 

tctctcagcg tatggttgtc gcctgagctg tagttgcctt catcgatgaa ctgctgtaca 60 
ttttgatacg tttttccgtc accgtcaaag attgatttat aatcctctac accgttgatg 120 
ttcaaagagc tgtctgatgc tgatacgtta acttgtgcag ttgtcagtgt ttgtttgccg 180 
taatgtttac cggagaaatc agtgtagaat aaacggattt ttccgtcaga tgtaaatgtg 240 
gctgaacctg accattcttg tgtttggtct tttaggatag aatcatttgc atcgaatttg 300 
tcgctgtctt taaagacgcg gccagcgttt ttccagctgt caatagaagt ttcgccgact 360 
ttttgataga acatgtaaat cgatgtgtca tccgcatttt taggatctcc ggctaatgca 420 
aagacgatgt ggtagccgtg atagtttgcg acagtgccgt cagcgttttg taatggccag 480 
ctgtcccaaa cgtccaggcc ttttgcagaa gagatatttt taattgtgga cgaatcaaat 540 
tcagaaactt gatatttttc atttttttgc tgttcaggga tttgcagcat atcatggcgt 600 
gtaatatggg aaatgccgta tgtttcctta tatggctttt ggttcgtttc tttcgcaaac 660 
gcttgagttg cgcctcctgc cagcagtgcg gtagtaaagg ttaatactgt tgcttgtttt 720 
gcaaactttt tgatgttcat cgttcatgtc tcctttttta tgtactgtgt tagcggtctg 780 
cttcttccag ccctcctgtt tgaagatggc aagttagtta cgcacaataa aaaaagacct 840 
aaaatatgta aggggtgacg ccaaagtata cactttgccc tttacacatt ttaggtcttg 900 
cctgctttat cagtaacaaa cccgcgcgat ttacttttcg acctcattct attagactct 960 
cgtttggatt gcaactggtc tattttcctc ttttgtttga tagaaaatca taaaaggatt 1020 
tgcagactac gggcctaaag aactaaaaaa tctatctgtt tcttttcatt ctctgtattt 1080 
tttatagttt ctgttgcatg ggcataaagt tgccttttta atcacaattc agaaaatatc 1140 
ataatatctc atttcactaa ataatagtga acggcaggta tatgtgatgg gttaaaaagg 1200 
atcggcggcc gctcgattta aatctcgaga ggcctgacgt cgggcccggt accacgcgtc 1260 
atatgactag ttcggaccta gggatatcgt cgacatcgat gctcttctgc gttaattaac 1320 
aattgggatc ctctagaccc gggatttaaa tcgctagcgg gctgctaaag gaagcggaac 1380 
acgtagaaag ccagtccgca gaaacggtgc tgaccccgga tgaatgtcag ctactgggct 1440 
atctggacaa gggaaaacgc aagcgcaaag agaaagcagg tagcttgcag tgggcttaca 1500 
tggcgatagc tagactgggc ggttttatgg acagcaagcg aaccggaatt gccagctggg 1560 
gcgccctctg gtaaggttgg gaagccctgc aaagtaaact ggatggcttt cttgccgcca 1620 
aggatctgat ggcgcagggg atcaagatct gatcaagaga caggatgagg atcgtttcgc 1680 
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atgattgaac aagatggatt gcacgcaggt tctccggccg cttgggtgga gaggctattc 1740 
ggctatgact gggcacaaca gacaatcggc tgctctgatg ccgccgtgtt ccggctgtca 1800 
gcgcaggggc gcccggttct ttttgtcaag accgacctgt ccggtgccct gaatgaactg 1860 
caggacgagg cagcgcggct atcgtggctg gccacgacgg gcgttccttg cgcagctgtg 1920 
ctcgacgttg tcactgaagc gggaagggac tggctgctat tgggcgaagt gccggggcag 1980 
gatctcctgt catctcacct tgctcctgcc gagaaagtat ccatcatggc tgatgcaatg 2040 
cggcggctgc atacgcttga tccggctacc tgcccattcg accaccaagc gaaacatcgc 2100 
atcgagcgag cacgtactcg gatggaagcc ggtcttgtcg atcaggatga tctggacgaa 2160 
gagcatcagg ggctcgcgcc agccgaactg ttcgccaggc tcaaggcgcg catgcccgac 2220 
ggcgaggatc tcgtcgtgac ccatggcgat gcctgcttgc cgaatatcat ggtggaaaat 2280 
ggccgctttt ctggattcat cgactgtggc cggctgggtg tggcggaccg ctatcaggac 2340 
atagcgttgg ctacccgtga tattgctgaa gagcttggcg gcgaatgggc tgaccgcttc 2400 
ctcgtgcttt acggtatcgc cgctcccgat tcgcagcgca tcgccttcta tcgccttctt 2460 
gacgagttct tctgagcggg actctggggt tcgaaatgac cgaccaagcg acgcccaacc 2520 
tgccatcacg agatttcgat tccaccgccg ccttctatga aaggttgggc ttcggaatcg 2580 
ttttccggga cgccggctgg atgatcctcc agcgcgggga tctcatgctg gagttcttcg 2640 
cccacgctag cggcgcgccg gccggcccgg tgtgaaatac cgcacagatg cgtaaggaga 2700 
aaataccgca tcaggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt 2760 
cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca 2820 
ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa 2880 
aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 2940 
cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 3000 
cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc 3060 
gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt 3120 
tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac 3180 
cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 3240 
ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca 3300 
gagttcttga agtggtggcc taactacggc tacactagaa ggacagtatt tggtatctgc 3360 
gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa 3420 
accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa 3480 
ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac 3540 
tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta 3600 
aaggccggcc gcggccgcca tcggcatttt cttttgcgtt tttatttgtt aactgttaat 3660 
tgtccttgtt caaggatgct gtctttgaca acagatgttt tcttgccttt gatgttcagc 3720 
aggaagctcg gcgcaaacgt tgattgtttg tctgcgtaga atcctctgtt tgtcatatag 3780 
cttgtaatca cgacattgtt tcctttcgct tgaggtacag cgaagtgtga gtaagtaaag 3840 
gttacatcgt taggatcaag atccattttt aacacaaggc cagttttgtt cagcggcttg 3900 
tatgggccag ttaaagaatt agaaacataa ccaagcatgt aaatatcgtt agacgtaatg 3960 
ccgtcaatcg tcatttttga tccgcgggag tcagtgaaca ggtaccattt gccgttcatt 4020 
ttaaagacgt tcgcgcgttc aatttcatct gttactgtgt tagatgcaat cagcggtttc 4080 
atcacttttt tcagtgtgta atcatcgttt agctcaatca taccgagagc gccgtttgct 4140 
aactcagccg tgcgtttttt atcgctttgc agaagttttt gactttcttg acggaagaat 4200 
gatgtgcttt tgccatagta tgctttgtta aataaagatt cttcgccttg gtagccatct 4260 
tcagttccag tgtttgcttc aaatactaag tatttgtggc ctttatcttc tacgtagtga 4320 
99a ~ 4323 



<210> 65 

<211> 35 

<212> DNA 

<213> PCR Primer 

<400> 65 

gagagagaga cgcgtcccag tggctgagac gcatc 35 

<210> 66 

<211> 34 

<212> DNA 

<213> PCR Primer 

<400> 66 

ctctctctgt cgacgaattc aatcttacgg cctg 34 



WO 03/087386 



238 

<210> 67 

<211> 38 

<212> DNA 

<213> PCR Primer 

<400> 67 

cggcaccacc gacatcatct tcacctgccc tcgttccg 



<210> 68 

<211> 38 

<212> DMA 

<213> PCR Primer 

<400> 68 

cggaacgagg gcaggtgaag atgatgtcgg tggtgccg 



<210> 69 

<211> 31 

<212> DNA 

<213> PCR Primer 

<400> 69 

gagactcgag ggaaggtgaa tcgaatttcg g 



<210> 70 

<211> 38 

<212> DNA 

<213> PCR Primer 

<400> 70 

gtcccgggga gaacgcacga ttctccaaaa ataatcgc 



<210> 71 

<211> 23 

<212> DNA 

<213> PCR Primer 

<400> 71 

gaatcgtgcg ttctccccgg gac 



<210> 72 

<211> 22 

<212> DNA 

<213> PCR Primer 

<400> 72 

gtagttgacc gagttgatca cc 



<210> 73 

<211> 18 

<212> DNA 

<213> PCR Primer 

<400> 73 

ccggcctgga gaagctcg 



<210> 74 

<211> 28 

<212> DNA 

<213> PCR Primer 

<400> 74 
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gagagatatc cctcagcggg cgttgaag 28 



<210> 


75 


<211> 


1266 


<212> 


DNA 


<213> 


LysC Mutante 


<220> 




<221> 


CDS 


<222> 


(1) . . (1266) 


<223> 




<400> 


75 



gaa ctt eta gaa ctt gca gcg gca gtg aat ccc gtt ccg cca get cgt 

Glu Leu Leu Glu Leu Ala Ala Ala Val Asn Pro Val Pro Pro Ala Arg 
50 55 60 

gaa atg gat atg etc ctg act get ggt gag cgt att tct aac get etc 

Glu Met Asp Met Leu Leu Thr Ala Gly Glu Arg He Ser Asn Ala Leu 
65 70 75 eo 

gtc gee atg get att gag tec ctt ggc gca gaa gee caa tct ttc acg 

Val Ala Met Ala He Glu Ser Leu Gly Ala Glu Ala Gin Ser Phe Thr 



85 90 



95 



48 



96 



gtg gee ctg gtc gta cag aaa tat ggc ggt tec teg ctt gag agt gcg 
Val Ala Leu Val Val Gin Lys Tyr Gly Gly Ser Ser Leu Glu Ser Ala 
1 5 10 is 

gaa cgc att aga aac gtc get gaa egg ate gtt gee acc aag aag get 
Glu Arg He Arg Asn Val Ala Glu Arg He Val Ala Thr Lys Lys Ala 
20 25 30 

gga aat gat gtc gtg gtt gtc tgc tec gca atg gga gac acc acg gat 144 
Gly Asn Asp Val Val Val Val Cys Ser Ala Met Gly Asp Thr Thr Asp 
35 40 45 



192 



240 



288 



ggc tct cag get ggt gtg etc acc acc gag cgc cac gga aac gca cgc 336 
Gly Ser Gin Ala Gly Val Leu Thr Thr Glu Arg His Gly Asn Ala Arg 
100 105 no 

att gtt gat gtc act cca ggt cgt gtg cgt gaa gca etc gat gag ggc 384 
He val Asp Val Thr Pro Gly Arg Val Arg Glu Ala Leu Asp Glu Gly 
H5 120 125 

aag ate tgc att gtt get ggt ttc cag ggt gtt aat aaa gaa acc cgc 432 
Lys He Cys He Val Ala Gly Phe Gin Gly val Asn Lys Glu Thr Arg 
130 135 140 

gat gtc acc acg ttg ggt cgt ggt ggt tct gac acc act gca gtt gcg 480 
Asp Val Thr Thr Leu Gly Arg Gly Gly Ser Asp Thr Thr Ala Val Ala 
145 150 155 i 6 o 

ttg gca get get ttg aac get gat gtg tgt gag att tac teg gac gtt 528 
Leu Ala Ala Ala Leu Asn Ala Asp Val Cys Glu He Tyr Ser Asp Val 
165 170 * 175 

gac ggt gtg tat acc get gac ccg cgc ate gtt cct aat gca cag aag 576 
Asp Gly Val Tyr Thr Ala Asp Pro Arg He Val Pro Asn Ala Gin Lys 
180 185 190 

ctg gaa aag etc age ttc gaa gaa atg ctg gaa ctt get get gtt ggc 624 
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Leu Glu Lye Leu Ser Phe Glu Glu Met Leu Glu Leu Ala Ala Val Gly 
195 200 205 

tec aag att ttg gtg ctg cgc agt gtt gaa tac get cgt gca, ttc aat 672 
Ser Lye He Leu Val Leu Arg Ser Val Glu Tyr Ala Arg Ala Phe Asn 
210 215 220 

gtg cca ctt cgc gta cgc teg tct tat agt aat gat ccc ggc act ttg 720 
Val Pro Leu Arg Val Arg Ser Ser Tyr Ser Aen Abp Pro Gly Thr Leu 
22 5 230 235 240 

att gee ggc tct atg gag gat att cct gtg gaa gaa gca gtc ctt acc 768 
He Ala Gly Ser Met Glu Asp He Pro Val Glu Glu Ala Val Leu Thr 
245 250 255 

ggt gtc gca acc gac aag tec gaa gee aaa gta acc gtt ctg ggt att 816 
Gly Val Ala Thr Asp Lys Ser Glu Ala Lye Val Thr Val Leu Gly He 
260 265 270 

tec gat aag cca ggc gag get gcg aag gtt ttc cgt gcg ttg get gat 864 
Ser Asp Lye Pro Gly Glu Ala Ala Lys Val Phe Arg Ala Leu Ala Asp 
275 280 285 

gca gaa ate aac att gac atg gtt ctg cag aac gtc tct tct gta gaa 912 
Ala Glu He Asn He Asp Met Val Leu Gin Asn Val Ser Ser Val Glu 
290 295 300 

gac ggc acc acc gac ate ate ttc acc tgc cct cgt tec gap ggc cgc 960 
Asp Gly Thr Thr Asp He He Phe Thr Cys Pro Arg Ser Asp Gly Arg 
305 310 315 320 

cgc gcg atg gag ate ttg aag aag ctt cag gtt cag ggc aac tgg acc 1008 
Arg Ala Met Glu He Leu Lys Lys Leu Gin Val Gin Gly Asn Trp Thr 
325 330 a 335 

aat gtg ctt tac gac gac cag gtc ggc aaa gtc tec etc gtg ggt get 1056 
Asn Val Leu Tyr Asp Asp Gin Val Gly Lys Val Ser Leu Val Gly Ala 
340 345 350 

ggc atg aag tct cac cca ggt gtt acc gca gag ttc atg gaa get ctg 1104 
Gly Met Lys Ser His Pro Gly Val Thr Ala Glu Phe Met Glu Ala Leu 
355 360 365 

cgc gat gtc aac gtg aac ate gaa ttg att tec acc tct gag att cgt 1152 
Arg Asp Val Asn Val Asn He Glu Leu He Ser Thr Ser Glu He Arg 
370 375 380 

att tec gtg ctg ate cgt gaa gat gat ctg gat get get gca cgt gca 1200 
He Ser Val Leu He Arg Glu Asp Asp Leu Asp Ala Ala Ala Arg Ala 
385 390 395 400 

ttg cat gag cag ttc cag ctg ggc ggc gaa gac gaa gec gtc gtt tat 1248 
Leu His Glu Gin Phe Gin Leu Gly Gly Glu Asp Glu Ala Val Val Tyr 
405 410 415 

gca ggc acc gga cgc taa x2 $6 
Ala Gly Thr Gly Arg 
420 



<210> 76 
<2U> 421 
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<212> PRT 

<213> LysC Mutante 

<40O> 76 

Val Ala Leu Val Val Gin Lys Tyr Gly Gly Ser Ser Leu Glu Ser Ala 
15 10 15 

Glu Arg He Arg Asn Val Ala Glu Arg He Val Ala Thr Lys Lys Ala 
20 25 30 

Gly Asn Asp Val Val Val Val Cys Ser Ala Met Gly Asp Thr Thr Asp 
35 40 45 

Glu Leu Leu Glu Leu Ala Ala Ala Val Asn Pro Val Pro Pro Ala Arg 
50 55 60 

Glu Met Asp Met Leu Leu Thr Ala Gly Glu Arg He Ser Asn Ala Leu 
fi 5 70 75 80 

Val Ala Met Ala He Glu Ser Leu Gly Ala Glu Ala Gin Ser Phe Thr 
85 90 95 

Gly Ser Gin Ala Gly Val Leu Thr Thr Glu Arg His Gly Asn Ala Arg 
100 105 no 

lie Val Asp Val Thr Pro Gly Arg Val Arg Glu Ala Leu Asp Glu Gly 
115 120 ~ 125 

Lys He Cys He Val Ala Gly Phe Gin Gly Val Asn Lys Glu Thr Arg 
130 135 140 

Asp Val Thr Thr Leu Gly Arg Gly Gly Ser Asp Thr Thr Ala Val Ala 
145 150 155 160 

Leu Ala Ala Ala Leu Asn Ala Asp Val Cys Glu He Tyr Ser Asp Val 
165 170 175 

Asp Gly Val Tyr Thr Ala Asp Pro Arg He Val Pro Asn Ala Gin Lys 
180 185 190 

Leu Glu Lys Leu Ser Phe Glu Glu Met Leu Glu Leu Ala Ala Val Gly 
195 200 205 

Ser Lys He Leu Val Leu Arg Ser Val Glu Tyr Ala Arg Ala Phe Asn 
210 215 220 



Val Pro Leu Arg Val Arg Ser Ser Tyr Ser Asn Asp Pro Gly Thr Leu 
225 230 235 240 
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He Ala Gly Ser Met Glu Asp He Pro Val Glu Glu Ala Val Leu Thr 
245 250 ,255 



Gly Val Ala Thr Asp Lys Ser Glu Ala Lys Val Thr Val Leu Gly He 
260 265 270 



Ser Asp Lys Pro Gly Glu Ala Ala Lys Val Phe Arg Ala Leu Ala Asp 
275 280 285 



Ala Glu He Asn He Asp Met Val Leu Gln.Asn Val Ser Ser Val Glu 
290 295 300 



Asp Gly Thr Thr Asp He He Phe Thr Cys Pro Arg Ser Asp Gly Arg 
305 310 315 320 



Arg Ala Met Glu He Leu Lys Lys Leu Gin Val Gin Gly Asn Trp Thr 
325 330 335 



Asn Val Leu Tyr Asp Asp Gin Val Gly Lys Val Ser Leu Val Gly Ala 
340 345 35Q 



Gly Met Lys Ser His Pro Gly Val Thr Ala Glu Phe Met Glu Ala Leu 
355 360 365 



Arg Asp Val Asn Val Asn He Glu Leu He Ser Thr Ser Glu He Arg 
370 375 380 



He Ser Val Leu He Arg Glu Asp Asp Leu Asp Ala Ala Ala Arg Ala 
385 390 395 400 



Leu His Glu Gin Phe Gin Leu Gly Gly Glu Asp Glu Ala Val Val Tyr 
405 410 415 



Ala Gly Thr Gly Arg 
420 



<210> 77 
<211> 5860 
<212> DNA 
<213> Plasmid 
<400> 77 

cccggtacca cgcgtcccag tggctgagac gcatccgcta aagccccagg aaccctgtgc 60 
agaaagaaaa cactcctctg gctaggtaga cacagtttat aaaggtagag ttgagcgggt 120 
aactgtcagc acgtagatcg aaaggtgcac aaaggtggcc ctggtcgtac agaaatatgg 180 
cggttcctcg cttgagagtg cggaacgcat tagaaacgtc gctgaacgga tcgttgccac 240 



( 
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caagaaggct ggaaatgatg tcgtggttgt ctgctccgca atgggagaca ccacggatga 300 

acttctagaa cttgcagcgg cagtgaatcc cgttccgcca gctcgtgaaa tggatatgct 360 

cctgactgct ggtgagcgta tttctaacgc tctcgtcgcc atggctattg agtcccttgg 420 

cgcagaagcc caatctttca cgggctctca ggctggtgtg ctcaccaccg agcgccacgg 460 

aaacgcacgc attgttgatg tcactccagg tcgtgtgcgt gaagcactcg atgagggcaa 540 

gatctgcatt gttgctggtt tccagggtgt taataaagaa acccgcgatg tcaccacgtt 600 

gggtcgtggt ggttctgaca ccactgcagt tgcgttggca gctgctttga acgctgatgt 660 

gtgtgagatt tactcggacg ttgacggtgt gtataccgct gacccgcgca tcgttcctaa 720 

tgcacagaag ctggaaaagc tcagcttcga agaaatgctg gaacttgctg ctgttggctc 780 

caagattttg gtgctgcgca gtgttgaata cgctcgtgca ttcaatgtgc cacttcgcgt 840 

acgctcgtct tatagtaatg atcccggcac tttgattgcc ggctctatgg aggatattcc 900 

tgtggaagaa gcagtcctta ccggtgtcgc aaccgacaag tccgaagcca aagtaaccgt 960 

tctgggtatt tccgataagc caggcgaggc tgcgaaggtt ttccgtgcgt tggctgatgc 1020 

agaaatcaac attgacatgg ttctgcagaa cgtctcttct gtagaagacg gcaccaccga 1080 

catcaccttc acctgccctc gttccgacgg ccgccgcgcg atggagatct tgaagaagct 1140 

tcaggttcag ggcaactgga ccaatgtgct ttacgacgac caggtcggca aagtctccct 12O0 

cgtgggtgct ggcatgaagt ctcacccagg tgttaccgca gagttcatgg aagctctgcg 1260 

cgatgtcaac gtgaacatcg aattgatttc cacctctgag attcgtattt ccgtgctgat 1320 

ccgtgaagat gatctggatg ctgctgcacg tgcattgcat gagcagttcc agctgggcgg 1380 

cgaagacgaa gccgtcgttt atgcaggcac cggacgctaa agttttaaag gagtagtttt 1440 

acaatgacca ccatcgcagt tgttggtgca accggccagg tcggccaggt tatgcgcacc 1500 

cttttggaag agcgcaattt cccagctgac actgttcgtt tctttgcttc cccacgttcc 1560 

gcaggccgta agattgaatt cgtcgacatc gatgctcttc tgcgttaatt aacaattggg 1620 

atcctctaga cccgggattt aaatcgctag cgggctgcta aaggaagcgg aacacgtaga 1680 

aagccagtcc gcagaaacgg tgctgacccc ggatgaatgt cagctactgg gctatctgga l 740 

caagggaaaa cgcaagcgca aagagaaagc aggtagcttg cagtgggctt acatggcgat 1800 

agctagactg ggcggtttta tggacagcaa gcgaaccgga attgccagct ggggcgccct I860 

ctggtaaggt tgggaagccc tgcaaagtaa actggatggc tttcttgccg ccaaggatct 1920 

gatggcgcag gggatcaaga tctgatcaag agacaggatg aggatcgttt cgcatgattg 1980 

aacaagatgg attgcacgca ggttctccgg ccgcttgggt ggagaggcta ttcggctatg 2040 

actgggcaca acagacaatc ggctgctctg atgccgccgt gttccggctg tcagcgcagg 2100 



WO 03/087386 PCT/EP03/04010 

244 

ggcgcccggt tctttttgtc aagaccgacc tgtccggtgc cctgaatgaa ctgcaggacg 2160 

aggcagcgcg gctatcgtgg ctggccacga cgggcgttcc ttgcgcagct gtgctcgacg 2220 

ttgtcactga agcgggaagg gactggctgc tattgggcga agtgccgggg caggatctcc 2280 

tgtcatctca ccttgctcct gccgagaaag tatccatcat ggctgatgca atgcggcggc 2340 

tgcatacgct tgatccggct acctgcccat tcgaccacca agcgaaacat. cgcatcgagc 2400 

gagcacgtac tcggatggaa gccggtcttg tcgatcagga tgatctggac gaagagcatc 2460 

aggggctcgc gccagccgaa ctgttcgcca ggctcaaggc gcgcatgccc gacggcgagg 2520 

atctcgtcgt gacccatggc gatgcctgct tgccgaatat catggtggaa aatggccgct 25B0 

tttctggatt catcgactgt ggccggctgg gtgtggcgga ccgctatcag gacatagcgt 2640 

tggctacccg tgatattgct gaagagcttg gcggcgaatg ggctgaccgc ttcctcgtgc 2700 

tttacggtat cgccgctccc gattcgcagc gcatcgcctt ctatcgcctt cttgacgagt 2760 

tcttctgagc gggactctgg ggttcgaaat gaccgaccaa gcgacgccca acctgccatc 2820 

acgagatttc gattccaccg ccgccttcta tgaaaggttg ggcttcggaa tcgttttccg 2880 

ggacgccggc tggatgatcc tccagcgcgg ggatctcatg ctggagttct tcgcccacgc 2940 

tagcggcgcg ccggccggcc cggtgtgaaa taccgcacag atgcgtaagg agaaaatacc 3000 

gcatcaggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc 3060 

ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata 3120 

acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 3180 

cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct 3240 

caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa 3300 

gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc 3360 

tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt 3420 

aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 3480 

ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg 3540 

cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct 3600 

tgaagtggtg gcctaactac ggctacacta gaaggacagt atttggtatc tgcgctctgc 3660 

tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg 3720 

ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 3780 

aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt 3840 

aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaaggccg 3900 

gccgcggccg ccatcggcat tttcttttgc gtttttattt gttaactgtt aattgtcctt 3960 



( 
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gttcaaggat gctgtctttg acaacagatg ttttcttgcc tttgatgttc agcaggaagc 4020 

tcggcgcaaa cgttgattgt ttgtctgcgt agaatcctct gtttgtcata tagcttgtaa 4080 

tcacgacatt gtttcctttc gcttgaggta cagcgaagtg tgagtaagta aaggttacat 4140 

cgttaggatc aagatccatt tttaacacaa ggccagtttt gttcagcggc ttgtatgggc 4200 

cagttaaaga attagaaaca taaccaagca tgtaaatatc gttagacgta atgccgtcaa 4260 

tcgtcatttt tgatccgcgg gagtcagtga acaggtacca tttgccgttc attttaaaga 4320 

cgttcgcgcg ttcaatttca tctgttactg tgttagatgc aatcagcggt ttcatcactt 4380 

ttttcagtgt gtaatcatcg tttagctcaa tcataccgag agcgccgttt gctaactcag 4440 

ccgtgcgttt tttatcgctt tgcagaagtt tttgactttc ttgacggaag aatgatgtgc 4500 

ttttgccata gtatgctttg ttaaataaag attcttcgcc ttggtagcca tcttcagttc 4560 

cagtgtttgc ttcaaatact aagtatttgt ggcctttatc ttctacgtag tgaggatctc 4620 

tcagcgtatg gttgtcgcct gagctgtagt tgccttcatc gatgaactgc tgtacatttt 4680 

gatacgtttt tccgtcaccg tcaaagattg atttataatc ctctacaccg ttgatgttca 4740 

aagagctgtc tgatgctgat acgttaactt gtgcagttgt cagtgtttgt ttgccgtaat 4800 

gtttaccgga gaaatcagtg tagaataaac ggatttttcc gtcagatgta aatgtggctg 4860 

aacctgacca ttcttgtgtt tggtctttta ggatagaatc atttgcatcg aatttgtcgc 4920 

tgtctttaaa gacgcggcca gcgtttttcc agctgtcaat agaagtttcg ccgacttttt 4980 

gatagaacat gtaaatcgat gtgtcatccg catttttagg atctccggct aatgcaaaga 5040 

cgatgtggta gccgtgatag tttgcgacag tgccgtcagc gttttgtaat ggccagctgt 5100 

cccaaacgtc caggcctttt gcagaagaga tatttttaat tgtggacgaa tcaaattcag 5160 

aaacttgata tttttcattt ttttgctgtt cagggatttg cagcatatca tggcgtgtaa 5220 

tatgggaaat gccgtatgtt tccttatatg gcttttggtt cgtttctttc gcaaacgctt 5280 

gagttgcgcc tcctgccagc agtgcggtag taaaggttaa tactgttgct tgttttgcaa 5340 

actttttgat gttcatcgtt catgtctcct tttttatgta ctgtgttagc ggtctgcttc 5400 

ttccagccct cctgtttgaa gatggcaagt tagttacgca caataaaaaa agacctaaaa 5460 

tatgtaaggg gtgacgccaa agtatacact ttgcccttta cacattttag gtcttgcctg 5520 

ctttatcagt aacaaacccg cgcgatttac ttttcgacct cattctatta gactctcgtt 5580 

tggattgcaa ctggtctatt ttcctctttt gtttgataga aaatcataaa aggatttgca 5640 

gactacgggc ctaaagaact aaaaaatcta tctgtttctt ttcattctct gtatttttta 5700 

tagtttctgt tgcatgggca taaagttgcc tttttaatca caattcagaa aatatcataa 5760 

tatctcattt cactaaataa tagtgaacgg caggtatatg tgatgggtta aaaaggatcg 5820 
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gcggccgctc gatttaaatc tcgagaggcc tgacgtcggg 5860 

<210> 78 
<2U> 5860 
<212> DNA 
<213> Plasmid 
<400> 78 

cccggtacca cgcgtcccag tggctgagac gcatccgcta aagccccagg aaccctgtgc 60 

agaaagaaaa cactcctctg gctaggtaga cacagtttat aaaggtagag ttgagcgggt 120 

aactgtcagc acgtagatcg aaaggtgcac aaaggtggcc ctggtcgtac agaaatatgg 180 

cggttcctcg cttgagagtg cggaacgcat tagaaacgtc gctgaacgga tcgttgccac 240 

caagaaggct ggaaatgatg tcgtggttgt ctgctccgca atgggagaca ccacggatga 300 

acttctagaa cttgcagcgg cagtgaatcc cgttccgcca gctcgtgaaa tggatatgct 360 

cctgactgct ggtgagcgta tttctaacgc tctcgtcgcc atggctattg agtcccttgg 420 

cgcagaagcc caatctttca cgggctctca ggctggtgtg ctcaccaccg agcgccacgg 480 

aaacgcacgc attgttgatg tcactccagg tcgtgtgcgt gaagcactcg atgagggcaa 540 

gatctgcatt gttgctggtt tccagggtgt taataaagaa acccgcgatg tcaccacgtt 600 

gggtcgtggt ggttctgaca ccactgcagt tgcgttggca gctgctttga acgctgatgt 660 

gtgtgagatt tactcggacg ttgacggtgt gtataccgct gacccgcgca tcgttcctaa 720 

tgcacagaag ctggaaaagc tcagcttcga agaaatgctg gaacttgctg ctgttggctc 780 

caagattttg gtgctgcgca gtgttgaata cgctcgtgca ttcaatgtgc cacttcgcgt 840 

acgctcgtct tatagtaatg atcccggcac tttgattgcc ggctctatgg aggatattcc 900 

tgtggaagaa gcagtcctta ccggtgtcgc aaccgacaag tccgaagcca aagtaaccgt 960 

tctgggtatt tccgataagc caggcgaggc tgcgaaggtt ttccgtgcgt tggctgatgc 1020 

agaaatcaac attgacatgg ttctgcagaa cgtctcttct gtagaagacg gcaccaccga 1080 

catcatcttc acctgccctc gttccgacgg ccgccgcgcg atggagatct tgaagaagct 1140 

tcaggttcag ggcaactgga ccaatgtgct ttacgacgac caggtcggca aagtctccct 1200 

cgtgggtgct ggcatgaagt ctcacccagg tgttaccgca gagttcatgg aagctctgcg 1260 

cgatgtcaac gtgaacatcg aattgatttc cacctctgag attcgtattt ccgtgctgat 1320 

ccgtgaagat gatctggatg ctgctgcacg tgcattgcat gagcagttcc agctgggcgg 1380 

cgaagacgaa gccgtcgttt atgcaggcac cggacgctaa agttttaaag gagtagtttt 1440 

acaatgacca ccatcgcagt tgttggtgca accggccagg tcggccaggt tatgcgcacc 1500 

cttttggaag agcgcaattt cccagctgac actgttcgtt tctttgcttc cccacgttcc 1560 

gcaggccgta agattgaatt cgtcgacatc gatgctcttc tgcgttaatt aacaattggg 1620 
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atcctctaga cccgggattt aaatcgctag cgggctgcta aaggaagcgg aacacgtaga 1680 

aagccagtcc gcagaaacgg tgctgacccc ggatgaatgt cagctactgg gctatctgga 1740 

caagggaaaa cgcaagcgca aagagaaagc aggtagcttg cagtgggctt acatggcgat 1800 

agctagactg ggcggtttta tggacagcaa gcgaaccgga attgccagct ggggcgccct 1860 

ctggtaaggt tgggaagccc tgcaaagtaa actggatggc tttcttgccg ccaaggatct 1920 

gatggcgcag gggatcaaga tctgatcaag agacaggatg aggatcgttt cgcatgattg 1980 

aacaagatgg attgcacgca ggttctccgg ccgcttgggt ggagaggcta ttcggctatg 2040 

actgggcaca acagacaatc ggctgctctg atgccgccgt gttccggctg tcagcgcagg 2100 

ggcgcccggt tctttttgtc aagaccgacc tgtccggtgc cctgaatgaa ctgcaggacg 2160 

aggcagcgcg gctatcgtgg ctggccacga cgggcgttcc ttgcgcagct gtgctcgacg 2220 

ttgtcactga agcgggaagg gactggctgc tattgggcga agtgccgggg caggatctcc 2280 

tgtcatctca ccttgctcct gccgagaaag tatccatcat ggctgatgca atgcggcggc 2340 

tgcatacgct tgatccggct acctgcccat tcgaccacca agcgaaacat cgcatcgagc 2400 

gagcacgtac tcggatggaa gccggtcttg tcgatcagga tgatctggac gaagagcatc 2460 

aggggctcgc gccagccgaa ctgttcgcca ggctcaaggc gcgcatgccc gacggcgagg 2520 

atctcgtcgt gacccatggc gatgcctgct tgccgaatat catggtggaa aatggccgct 2580 

tttctggatt catcgactgt ggccggctgg gtgtggcgga ccgctatcag gacatagcgt 2640 

tggctacccg tgatattgct gaagagcttg gcggcgaatg ggctgaccgc ttcctcgtgc 2700 

tttacggtat cgccgctccc gattcgcagc gcatcgcctt ctatcgcctt cttgacgagt 2760 

tcttctgagc gggactctgg ggttcgaaat gaccgaccaa gcgacgccca acctgccatc 2820 

acgagatttc gattccaccg ccgccttcta tgaaaggttg ggcttcggaa tcgttttccg 2880 

ggacgccggc tggatgatcc tccagcgcgg ggatctcatg ctggagttct tcgcccacgc 2940 

tagcggcgcg ccggccggcc cggtgtgaaa taccgcacag atgcgtaagg agaaaatacc 3000 

gcatcaggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc 3060 

ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata 3120 

acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 3180 

cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct 3240 

caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa 3300 

gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc 3360 

tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt 3420 

aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 3480 
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ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg 3540 

cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct 3600 

tgaagtggtg gcctaactac ggctacacta gaaggacagt atttggtatc tgcgctctgc 3660 

tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg 3720 

ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa, aaaggatctc 3780 

aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt 3840 

aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaaggccg 3900 

gccgcggccg ccatcggcat tttcttttgc gtttttattt gttaactgtt aattgtcctt 3960 

gttcaaggat gctgtctttg acaacagatg ttttcttgcc tttgatgttc agcaggaagc 4020 

tcggcgcaaa cgttgattgt ttgtctgcgt agaatcctct gtttgtcata tagcttgtaa 4080 

tcacgacatt gtttcctttc gcttgaggta cagcgaagtg tgagtaagta aaggttacat 4140 

cgttaggatc aagatccatt tttaacacaa ggccagtttt gttcagcggc ttgtatgggc 4200 

cagttaaaga attagaaaca taaccaagca tgtaaatatc gttagacgta atgccgtcaa 4260 

tcgtcatttt tgatccgcgg gagtcagtga acaggtacca tttgccgttc attttaaaga 4320 

cgttcgcgcg ttcaatttca tctgttactg tgttagatgc aatcagcggt ttcatcactt 4380 

ttttcagtgt gtaatcatcg tttagctcaa tcataccgag agcgccgttt gctaactcag 4440 

ccgtgcgttt tttatcgctt tgcagaagtt tttgactttc ttgacggaag aatgatgtgc 4500 

ttttgccata gtatgctttg ttaaataaag attcttcgcc ttggtagcca tcttcagttc 4560 

cagtgtttgc ttcaaatact aagtatttgt ggcctttatc ttctacgtag tgaggatctc 4620 

tcagcgtatg gttgtcgcct gagctgtagt tgccttcatc gatgaactgc tgtacatttt 4680 

gatacgtttt tccgtcaccg tcaaagattg atttataatc ctctacaccg ttgatgttca 4740 

aagagctgtc tgatgctgat acgttaactt gtgcagttgt cagtgtttgt ttgccgtaat 4 800 

gtttaccgga gaaatcagtg tagaataaac ggatttttcc gtcagatgta aatgtggctg 4860 

aacctgacca ttcttgtgtt tggtctttta ggatagaatc atttgcatcg aatttgtcgc 4920 

tgtctttaaa gacgcggcca gcgtttttcc agctgtcaat agaagtttcg ccgacttttt 4980 

gatagaacat gtaaatcgat gtgtcatccg catttttagg atctccggct aatgcaaaga 5040 

cgatgtggta gccgtgatag tttgcgacag tgccgtcagc gttttgtaat ggccagctgt 5100 

cccaaacgtc caggcctttt gcagaagaga tatttttaat tgtggacgaa tcaaattcag 5160 

aaacttgata tttttcattt ttttgctgtt cagggatttg cagcatatca tggcgtgtaa 5220 

tatgggaaat gccgtatgtt tccttatatg gcttttggtt cgtttctttc gcaaacgctt 5280 

gagttgcgcc tcctgccagc agtgcggtag taaaggttaa tactgttgct tgttttgcaa 5340 
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actttttgat gttcatcgtt catgtctcct tttttatgta ctgtgttagc ggtctgcttc 5400 

ttccagccct cctgtttgaa gatggcaagt tagttacgca caataaaaaa agacctaaaa 5460 

tatgtaaggg gtgacgccaa agtatacact ttgcccttta cacattttag gtcttgcctg 5520 

ctttatcagt aacaaacccg cgcgatttac ttttcgacct cattctatta gactctcgtt 5580 

tggattgcaa ctggtctatt ttcctctttt gtttgataga aaatcataaa aggatttgca 5640 

gactacgggc ctaaagaact aaaaaatcta tctgtttctt ttcattctct gtatttttta 5700 

tagtttctgt tgcatgggca taaagttgcc tttttaatca caattcagaa aatatcataa 5760 

tatctcattt cactaaataa tagtgaacgg caggtatatg tgatgggtta aaaaggatcg 5820 

gcggccgctc gatttaaatc tcgagaggcc tgacgtcggg 5860 

<210> 79 
<211> 8787 
<212> DNA 
<213> Plasmid 
<400> 79 

tcgagggaag gtgaatcgaa tttcggggct ttaaagcaaa aatgaacagc ttggtctata 60 

gtggctaggt accctttttg ttttggacac atgtagggtg gccgaaacaa agtaatagga 120 

caacaacgct cgaccgcgat tatttttgga gaatcgtgcg ttctccccgg gacgtcccac 180 

gacgggcggc accgggcaga ggcaaagccg acagccgtcg catcctaggg agccctttca 240 

tggcctcgtc gccatccacc ccgcccgccg acacccgcac ccgcgtgtcc gccctccgag 300 

aggccctcgc cacccgcgtg gtggtcgccg acggcgccat gggcaccatg ctccaggccc 360 

agaaccccac gctggacgac ttccagcagc tcgaagggtg caacgaggtc ctgaacctca 420 

cccggcccga catcgtccgc tcggtgcacg aggagtactt cgcggccggc gtcgactgcg 480 

tcgagaccaa caccttcggc gccaaccact ccgccctggg cgagtacgac atccccgagc 540 

gcgtccacga actgtccgag gccggcgccc gcgtcgcccg cgaggtcgcc gacgagttcg 600 

gcgcccgcga cggccggcag cgctgggtgc tgggctccat gggccccggc accaagctcc 660 

ccaccctcgg ccacgccccg tacaccgtcc tgcgcgacgc ctaccagcgc aacgccgagg 720 

gactggtcgc gggcggcgcg gacgcactgc tggtggagac cacgcaggac ctgctccaga 780 

ccaaggcctc ggtgctcggc gcccggcgcg ccctggacgt cctcggcctc gacctgccgc 840 

tcatcgtgtc cgtcaccgtc gagaccaccg gcaccatgct gctcggctcg gagatcggcg 900 

ccgcgctcac cgcgctggaa ccgctcggca tcgacatgat cggcctgaac tgcgccaccg 960 

gccccgccga gatgagcgag cacctgcgct acctcgcccg gcactcccgc atcccgctga 1020 

cctgcatgcc caacgccggt ctgcccgtcc tcggcaagga cggcgcccac tacccgctga 1080 

ccgcgcccga gctggccgac gcacacgaga ccttcgtgcg cgagtacggc ctgtccctgg 1140 
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tcggcggctg ctgcggcacc acgcccgagc acctgcgcca ggtcgtcgag cgggtccggg 1200 

acaccgcccc caccgcacgc gacccgcgcc ccgagcccgg cgccgcctcg ctctaccaga 1260 

ccgtgccctt ccgccaggac acctcctacc tggccatcgg cgagcgcacc aacgccaacg 1320 

ggtccaagaa gttccgcgag gccatgctgg acggccgctg ggacgactgc gtcgagatgg 1380 

cccgcgacca gatccgcgaa ggcgcgcaca tgctcgacct ctgcgtcgaq tacgtcggcc 1440 

gggacggcgt cgccgacatg gaggaactgg ccggccggtt cgccaccgcc tccacgctgc 1500 

cgatcgtcct cgactccacc gaggtcgacg tcatccgggc cggcctggag aagctcggcg 1560 

gccgcgcggt gatcaactcg gtcaactacg aggacggcgc cggccccgag tcccggttcg 1620 

cccgcgtcac gaagctcgcc cgggagcacg gcgccgcgct gatcgcgctg accatcgacg 1680 

aggtgggaca ggcccgcacc gccgagaaga aggtcgagat cgccgaacgg ctcatcgacg 1740 

acctcaccgg caactggggc atccacgagt ccgacatcct cgtcgactgc ctgaccttca 1800 

ccatctgcac cggccaggag gagtcccgca aggacggcct ggccaccatc gagggcatcc 1860 

gggaactcaa gcggcgccac ccggacgtgc agaccacgct cggcctgtcg aacatctcct 1920 

tcggcctcaa cccggccgcc cgcatcctgc tcaactccgt cttcctcgac gaatgcgtca 1980 

aggccggcct ggactcggcc atcgtgcacg cgagcaagat cctgccgatc gcccgcttcg 2040 

acgaggagca ggtcaccacc gccctcgact tgatctacga ccgccgccgc gagggctacg 2100 

accccctgca aaagctcatg cagctcttcg agggcgccac cgccaagtcg ctgaaggcct 2160 

ccaaggccga ggaactggcc gccctcccgc tggaggagcg cctcaagcgc cgcatcatcg 2220 

acggcgagaa gaacggcctc gaacaggacc tcgacgaggc cctccgggag cgcccggccc 2280 

tcgagatcgt caacgacacc ctgctcgacg gtatgaaggt cgtcggcgag ctgttcggct 2340 

ccggccagat gcagctgccg ttcgtgctcc agtccgccga ggtcatgaag accgcggtgg 2400 

cccacctgga gccgcacatg gagaagaccg acgacgacgg caagggcacg atcgtgctgg 2460 

ccaccgtccg cggcgacgtc cacgacatcg gcaagaacct cgtcgacatc atcctgtcca 2520 

acaacggcta caacgtcgtc aacctcggca tcaagcagcc cgtctccgcg atcctggaag 25 BO 

cggccgacga gcaccgggcc gacgtcatcg gcatgtccgg cctcctcgtc aagtccacgg 2640 

tgatcatgaa ggagaacctg gaggagctga accagcgcaa gctggccgcc gactacccgg 2700 

tcatcctcgg cggcgccgcc ctcaccaggg cctacgtcga acaggacctg cacgagatct 2760 

acgacggcga ggtccgctac gcccgcgacg ccttcgaggg cctgcgcctc atggacgccc 2820 

tcatcggcat caagcgcggc gtgcccggcg ccaagctgcc ggagctgaag cagcgccggg 2880 

tgcgggccgc caccgtcgag atcgacgagc gccccgagga aggccacgtc cgctccgacg 2940 

tcgccaccga caacccggtc ccgaccccgc ccttccgcgg cacccgcgtc gtcaagggca 3000 
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tccagctcaa ggagtacgcc tcctggctcg acgagggcgc cctcttcaag ggccagtggg 3060 
gcctcaagca ggcccgcacc ggcgagggac cctcctacga ggaactggtc gagtccgagg 3120 
gccggccgcg gctgcgcggc ctgctcgacc ggctccagac ggacaacctt ttggaggcgg 3180 
ccgtggtcta cggctacttc ccctgcgtct ccaaggacga cgacctgatc gtcctcgacg 3240 
acgacggcaa cgaacgcacc cgcttcacct tcccccgcca gcgccgcggc cggcgcctgt 3300 
gcctggccga cttcttccgc ccggaggagt ccggcgagac cgacgtggtc ggcttccagg 3360 
tcgtcaccgt cggctcccgc atcggcgagg agacggcccg catgttcgag gccaacgcct 3420 
accgcgacta tctcgagctg cacggcctgt ccgtgcagct cgccgaggcc ctcgccgagt 3480 
actggcacgc gcgcgtgcgc tcggaactcg gcttcgccgg ggaggacccg gccgagatgg 3540 

aggacatgtt cgccctgaag taccggggtg cccgcttctc cctcggctac ggcgcctgcc 3600 

ccgacctgga ggaccgcgcc aagatcgccg ccctgctgga gcccgagcgc atcggcgtcc 3660 

acctatccga ggagttccag ctccaccccg agcagtccac cgacgccatc gtcatccacc 3720 

acccggaggc caagtacttc aacgcccgct gagggatatc gtcgacatcg atgctcttct 3780 

gcgttaatta acaattggga tcctctagac ccgggattta aatcgctagc gggctgctaa 3840 

aggaagcgga acacgtagaa agccagtccg cagaaacggt gctgaccccg gatgaatgtc 3900 

agctactggg ctatctggac aagggaaaac gcaagcgcaa agagaaagca ggtagcttgc 3960 

agtgggctta catggcgata gctagactgg gcggttttat ggacagcaag cgaaccggaa 4020 

ttgccagctg gggcgccctc tggtaaggtt gggaagccct gcaaagtaaa ctggatggct 4080 

ttcttgccgc caaggatctg atggcgcagg ggatcaagat ctgatcaaga gacaggatga 4X40 

ggatcgtttc gcatgattga acaagatgga ttgcacgcag gttctccggc cgcttgggtg 4200 

gagaggctat tcggctatga ctgggcacaa cagacaatcg gctgctctga tgccgccgtg 4260 

ttccggctgt cagcgcaggg gcgcccggtt ctttttgtca agaccgacct gtccggtgcc 4320 

ctgaatgaac tgcaggacga ggcagcgcgg ctatcgtggc tggccacgac gggcgttcct 4 380 

tgcgcagctg tgctcgacgt tgtcactgaa gcgggaaggg actggctgct attgggcgaa 4440 

gtgccggggc aggatctcct gtcatctcac cttgctcctg ccgagaaagt atccatcatg 4500 

gctgatgcaa tgcggcggct gcatacgctt gatccggcta cctgcccatt cgaccaccaa 4560 

gcgaaacatc gcatcgagcg agcacgtact cggatggaag ccggtcttgt cgatcaggat 4620 

gatctggacg aagagcatca ggggctcgcg ccagccgaac tgttcgccag gctcaaggcg 4680 

cgcatgcccg acggcgagga tctcgtcgtg acccatggcg atgcctgctt gccgaatatc 4740 

atggtggaaa atggccgctt ttctggattc atcgactgtg gccggctggg tgtggcggac 4800 

cgctatcagg acatagcgtt ggctacccgt gatattgctg aagagcttgg cggcgaatgg 4 860 
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gctgaccgct tcctcgtgct ttacggtatc gccgctcccg attcgcagcg catcgccttc 4920 

tatcgccttc ttgacgagtt cttctgagcg ggactctggg gttcgaaatg accgaccaag 4980 

cgacgcccaa cctgccatca cgagatttcg attccaccgc cgccttctat gaaaggttgg 5040 

gcttcggaat cgttttccgg gacgccggct ggatgatcct ccagcgcggg gatctcatgc 5100 

tggagttctt cgcccacgct agcggcgcgc cggccggccc ggtgtgaaat.accgcacaga 5160 

tgcgtaagga gaaaataccg catcaggcgc tcttccgctt cctcgctcac tgactcgctg 5220 

cgctcggtcg ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta 5280 

tccacagaat caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc 5340 

aggaaccgta aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag 5400 

catcacaaaa atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac 5460 

caggcgtttc cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc 5520 

ggatacctgt ccgcctttct cccttcggga agcgtggcgc tttctcatag ctcacgctgt 5580 

aggtatctca gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc 5640 

gttcagcccg accgctgcgc cttatccggt aactatcgtc ttgagtccaa cccggtaaga 5700 

cacgacttat cgccactggc agcagccact ggtaacagga ttagcagagc gaggtatgta 5760 

ggcggtgcta cagagttctt gaagtggtgg cctaactacg gctacactag aaggacagta 5820 

tttggtatct gcgctctgct gaagccagtt accttcggaa aaagagttgg tagctcttga 5880 

tccggcaaac aaaccaccgc tggtagcggt ggtttttttg tttgcaagca gcagattacg 5940 

cgcagaaaaa aaggatctca agaagatcct ttgatctttt ctacggggtc tgacgctcag 6000 

tggaacgaaa actcacgtta agggattttg gtcatgagat tatcaaaaag gatcttcacc 6060 

tagatccttt taaaggccgg ccgcggccgc gcaaagtccc gcttcgtgaa aattttcgtg 6120 

ccgcgtgatt ttccgccaaa aactttaacg aacgttcgtt ataatggtgt catgaccttc 6180 

acgacgaagt actaaaattg gcccgaatca tcagctatgg atctctctga tgtcgcgctg 6240 

gagtccgacg cgctcgatgc tgccgtcgat ttaaaaacgg tgatcggatt tttccgagct 6300 

ctcgatacga cggacgcgcc agcatcacga gactgggcca gtgccgcgag cgacctagaa 6360 

actctcgtgg cggatcttga ggagctggct gacgagctgc gtgctcggcc agcgccagga 6420 

ggacgcacag tagtggagga tgcaatcagt tgcgcctact gcggtggcct gattcctccc 6480 

cggcctgacc cgcgaggacg gcgcgcaaaa tattgctcag atgcgtgtcg tgccgcagcc 6540 

agccgcgagc gcgccaacaa acgccacgcc gaggagctgg aggcggctag gtcgcaaatg 6600 

gcgctggaag tgcgtccccc gagcgaaatt ttggccatgg tcgtcacaga gctggaagcg 6660 

gcagcgagaa ttatcgcgat cgtggcggtg cccgcaggca tgacaaacat cgtaaatgcc 6720 
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gcgtttcgtg tgccgtggcc gcccaggacg tgtcagcgcc gccaccacct 


gcaccgaatc 


6780 


ggcagcagcg tcgcgcgtcg aaaaagcgca caggcggcaa gaagcgataa 


gctgcacgaa 


6640 


tacctgaaaa atgttgaacg ccccgtgagc ggtaactcac agggcgtcgg 


ctaaccccca 


6900 


gtccaaacct gggagaaagc gctcaaaaat gactctagcg gattcacgag 


acattgacac 


6960 


accggcctgg aaattttccg ctgatctgtt cgacacccat cccgagctcg 


cgctgcgatc 


7020 


acgtggctgg acgagcgaag accgccgcga attcctcgct cacctgggca 


gagaaaattt 


7080 


ccagggcagc aagacccgcg acttcgccag cgcttggatc aaagacccgg 


acacggagaa 


7140 


acacagccga agttataccg agttggttca aaatcgcttg cccggtgcca 


gtatgttgct 


7200 


ctgacgcacg cgcagcacgc agccgtgctt gtcctggaca ttgatgtgcc 


gagccaccag 


7260 


gccggcggga aaatcgagca cgtaaacccc gaggtctacg cgattttgga 


gcgctgggca 


7320 


cgcctggaaa aagcgccagc ttggatcggc gtgaatccac tgagcgggaa 


atgccagctc 


7380 


atctggctca ttgatccggt gtatgccgca gcaggcatga gcagcccgaa 


tatgcgcctg 


7440 


ctggctgcaa cgaccgagga aatgacccgc gttttcggcg ctgaccaggc 


tttttcacat 


7500 


aggctgagcc gtggccactg cactctccga cgatcccagc cgtaccgctg 


gcatgcccag 


7560 


cacaatcgcg tggatcgcct agctgatctt atggaggttg ctcgcatgat 


ctcaggcaca 


7620 


gaaaaaccta aaaaacgcta tgagcaggag ttttctagcg gacgggcacg 


tatcgaagcg 


7680 


gcaagaaaag ccactgcgga agcaaaagca cttgccacgc ttgaagcaag 


cctgccgagc 


7740 


gccgctgaag cgtctggaga gctgatcgac ggcgtccgtg tcctctggac 


tgctccaggg 


7800 


cgtgccgccc gtgatgagac ggcttttcgc cacgctttga ctgtgggata 


ccagttaaaa 


7860 


gcggctggtg agcgcctaaa agacaccaag ggtcatcgag cctacgagcg 


tgcctacacc 


7920 


gtcgctcagg cggtcggagg aggccgtgag cctgatctgc cgccggactg 


tgaccgccag 


7980 


acggattggc cgcgacgtgt gcgcggctac gtcgctaaag gccagccagt 


cgtccctgct 


8040 


cgtcagacag agacgcagag ccagccgagg cgaaaagctc tggccactat 


gggaagacgt 


8100 


ggcggtaaaa aggccgcaga acgctggaaa gacccaaaca gtgagtacgc 


ccgagcacag 


8160 


cgagaaaaac tagctaagtc cagtcaacga caagctagga aagctaaagg 


aaatcgcttg 


8220 


accattgcag gttggtttat gactgttgag ggagagactg gctcgtggcc 


gacaatcaat 


8280 


gaagctatgt ctgaatttag cgtgtcacgt cagaccgtga atagagcact 


taaggtctgc 


8340 


gggcattgaa cttccacgag gacgccgaaa gcttcccagt aaatgtgcca 


tctcgtaggc 


8400 


agaaaacggt tcccccgtag ggtctctctc ttggcctcct ttctaggtcg 


ggctgattgc 


8460 


tcttgaagct ctctaggggg gctcacacca taggcagata acgttcccca 


ccggctcgcc 


8520 


tcgtaagcgc acaaggactg ctcccaaaga tcttcaaagc cactgccgcg 


actgccttcg 


8580 
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cgaagccttg ccccgcggaa atttcctcca ccgagttcgt gcacacccct atgccaagct 8640 

tctttcaccc taaattcgag agattggatt cttaccgtgg aaattcttcg paaaaatcgt 8700 

cccctgatcg cccttgcgac gttggcgtcg gtgccgctgg ttgcgcttgg cttgaccgac 8760 

ttgatcagcg gccgctcgat ttaaatc 8787 
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